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GENE EXEBESSION IN BLADDER TUMORS 



This application claims the benefit of U.S. Provisional Application 
No. 60/121,124, ffled Fdjruary 22, 1999, which is hereby incorporated by 
refi^ence in its mtirety. 

5 TF.^^HNTCAIL FIELD OF THE INVENTIQM 

This inv^on is related to ttie field of cancer diagnosis and treatment. 
In particular it is rdated to the use of gene expression to categorize and detect 
tumors. 

BAQCGROUND OF, THE imTSNTIQN 

10 The building of large databases containing human genome sequences 

is the basis for studies of goie expressions in various tissues during normal 
physotogkal and patholo^ coadStioas. Constantly (constitutively) expressed 
sequences as well as sequences whose expression is altered during disease 
processes are important for our understanding of cellular properties, and for 

15 the id^itification of candidate g^es for foture therapeutic intervention. As the 
number of knovm genes and ESTs build up m the databases, array-based 
simultaneous screening of thousands of genes is necessaiy to obtain a profile 
of transCTiptional behaviour, and to identify tey genes that, dtiier alone or in 
combination wrdi other genes, control various aspects of cellular life. One 
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Still another object of the invention is to provide a method of 
identifying a tissue sample as urothelial. 

Yet another object of the invention provides a method of determining 
an expression pattern of a bladder tissue sample independent of the proportion 
5 of submuOTsal, muscle, and connective tissue cells present 

These and other objects of the invention are achieved by providing 
one or more of the embodiments d^bed bdow. In one embodiment a 
method is provided of determining an expression pattern of a ceU sample 
independent of the proportion of submucosal, smooth muscle, or connective 
10 tissue cdls present. Ejcpression is det«inmed of one or more genes in a sample 
comprising cells. The one or more g^es exclude genes which are ejqjressed 
in the submucosal, muscle, and connective tissue. A pattern of expression is 
formed for the sample which is independent of the proportion of submucosal, 
muscle, and connective tissue cells in the sample. 
15 In another aspect of tiie invention a metiiod of detennining an 

expression pattern of a cell sample is provided. Expression is determined of 
one or more genes in a sample comprising cells. A first pattern of expression 
is thereby formed for the sample, G^nes which are expressed in submucosal, 
smootii nmscle, or connective tissue ceils are rraioved from the first pattern of 
20 expressiOT, forming a second pattern of epcpressaon which is independent of the 
proportion of submucosal, smooth muscle, or connective tissue cells in the 
sample. 

Another embodiment of tiie invention provides a method for 
detennfaiing an expression pattern of a urothelium or bladder cancer cell. 

25 Expression is determmed of one or more genes in a sample comprising 
urothelium or bladder cancer cells; the expression determmed fonns a first 
pattern of expression. A second pattern of ejqjression which was formed using 
the one or more genes and a sample comprising predominantly submucosal, 
smooth muscle, or connective tissue cells, is subtracted &om the first pattOT 

30 of expression, formmg a tidrd paUem of ejqpression. The tiiird pattern of 
expression reflects expression of tiie urothelium or bladder cancer cells 
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indq>endent of the proportion of submucosal, smooth muscle, or connective 
tissue cells present in the sample. 

In another erabodim^it of the invention a method is prodded of 
detecting an invasive tumor in a patient. A maricer is detected in a sample of 
5 a body fluid. The body Add is selected from the group consisting of blood, 
plasma, serum, urine, ascites fluid, pleural fluid, spinal fluid, sputum, and 
mucous secretions. The marker is an mRNA or protein expresaon product of 
a gaie which is more prevalent in submucosal, smooth muscle, or connective 
tissue than in the body fluid. An increased amount of the marker in the body 
10 fluid indicates a tumor which has become invasive in the patient. 

In anoth^ aspect of tiie invention a method is provided for diagnosing 
a bladder cancer. A first pattern of expression is determined of one or more 
genes in a bladder tissue sample suspected of being neoplastic. The first 
pattern of expression is compared to a second and third reference pattern of 
15 expression. The second pattern is of the one or more genes in normal 
urothdium and the third pattern is of the one or more genes in bladder cancer, 
A first pattern of expression which is found to be more similar to the third 
pattern than the second indicates neoplasia of the bladder tissue sample. 

According to yet another aspect of the invention a method is provided 
20 for predicting outcome or prescribing ^tment of a bladder tumor. A first 
pattern of expression is determined of one or more genes in a bladder tumor 
sample. The first pattern is compared to one or more reference patterns of 
^ression detmnined for bladder tumora at a grade between I and IV. The 
reference pattern which shares maximum similarity with the first pattern is 
25 identified. Theoutcomeortreatmentappropriateforthegradeoftumor of the 
referOTce pattmi vnth the raaxhnum sindlanty is assigned to the bladder tumor 
sample. 

In another embodiment of tiw invention a method is provided for 
determining grade of a bladder tumor. A first pattern of expression is 
30 detOTtmed of oi^ or more genes in a bladder tumor sample. The first pattern 
is compared to one or more reference pattwis of expression determined for 
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bladder tumors at a grade between I and IV. The reference pattern which 
shares maxinumisimaaiity with the first pattern is identified. The grade of the 
reference pattern with the maximura siniUarity is assigned to the bladder tumor 
santpie. 

Yet another embodiment of the invention provides a method to 
determine stage of a bladder tumor. A first pattern of expresaon is 
detairined of one or more genes in a bladder tumor sample. The first pattern 
is compared to one or more reference patterns of esqjression detemnned for 
bladder tumors at different stages. The reference pattern which shares 
maximum similarity with the first pattern U identified. The stage of the 
reference pattern with the maximum similarity is assigned to the bladder tumor 
sample. 

In still another embodiment of the invention a method is provided for 
identifying a tissue sample as urotheUai. A first pattern of expression is 
determined of one or more genes in a tissue sample. The first pattern of 
expression is compared to a second pattern of repression obtained fi-om nonnal 
urothdiai cells. Similarity between the first and second patterns identifies the 
tissue sample is urothelial in its origin. 

Another aspect of the invention is a method to identify a set of genes 
usefid for diagnosing, predcting (sitcome, or prescaibing treatment of a bladder 
cancer. A first pattran of expression is determined of one or more genes in a 
first bladder tissue sanqde. A second pattern of expression is determined of the 
one or more genes in a second bladder tissue sample. The first bladder tissue 
sample is a nonnal urotheUum sample or an eariier stage or lower grade of 
bladder tumor than the second bladder tissue sample. The first pattern of 
expression is compared to the second pattern of expression to identify a first 
set of genes whose esqjressiwi is increased or decreased in the second bladder 
tissue sample relative to the first bladder tissue sample. Those genes which are 
expressed in submucosal, smooth muscle or connective tissue are removed 
fix)m the first setrof gaies to produce a second set of genes. Expression of the 
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second set of genes can be used for diagnosing, predicting outcome, or 
prescribing treatment of a bladder cancer. 

Acconfing to yet another aspect of the invention a method is prodded 
for determining an expression pattern of a bladder tissue sample independent 

5 of the proportion of submucosal, smooth muscle, or connective tissue cells 
present A single-ceU suspension of disaggregated bladder tumor ceUs is 
isolated fixim a bladder tissue sample comprising bladder cells, submucosal 
ceUs, smooth muscle ceUs, or connective tissue cells. The expression of one 
or more genes in the single-cell suspenaon is determined. A pattern of 

10 expression is thus formed for the sample which is mdependent of the 
proportion of submucosal, smooth muscle, or connective tissue cells in the 

bladder tissue sample. 

According to still anotha- aspect of the invention a method is provided 
for soeening compounds to identify candidate therapeutic agents for treating 

15 bladder cancer. Bladder tumor ceUs are contacted with a test compound. 
Gene expresaon of one or more genes is determined in the bladder tumor cells 
which have been contacted with the test compound. The one or more genes 
are ones whose expresaon changes durii^ the development of a bladder 
cancer. A test compound is identified as a candidate therapeutic agent if it 

20 causes gene expression of at least one of the one or more genes to change to 
a levd wWch is characteristic of an earlier stage of cancer progression. 

The present faivaition tiius provides the art witii numerous methods for 
moleculariy assessing bladder cdls. The methods aid the art in di^osing, 
identi^rag, classifying, treating, detecting, and treating tumors of tiie bladdw. 

25 m W^ i^TT.^^rRTimnN OF THF, PKAWTNCrf^ AND TABLED 

Figure 1 shows a disti&mion of exjH^ssion levels in bladdw wall tissue 
expressed in arbitrary units. Only genes scored as present or marginally 

present are shovim. 

Figure 2-^ws a comparison of intenaty tUffeences betsveen a tumor 
30 and a pool of tumors of tiie same stage and grade, and two pools of differwit 
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Stages and grades. The differraice is lai^a between the pools, demonstrating 
the vaBdily of using expression patterns to deteraime stage, grade, prognosis, 
and treatment regimen. 

M genes scored as present on two chips {appro»mateiy 9000 genes) 

5 were compared. Increased gaies woe compared to mcreased genes and 
decreased genes to decreased genes, on two sq)anite chips, followed by 
plotting of the numerical dififaence of the medians. A TaGrH tumor conqwred 
to the TaGrH pool, and the TaGrn pooi to T2GrIV pool (1, 2, 3, 4); T2Grin 
to T2Grni pool, and T2Gtin pool to T2GrIV pool (5, 6. 7, 8); T2GrIV to 

10 T2GrIV pool, and T2GrIV pool to TaCMI pool (9, 10. 1 1, 12). Paired T-test 
of medians showed a borderline significant difference, with pool vs. pool 
scoring highest (P-value of 0.07). 

Figure 3 shows progression of a bladder cancer ftom normal urothelium 
to invasive grade IV tumor. The expression patterns change during the 

15 progression, with a great variation in pattern from stage to stage, but also 
within a stage and even within tumors having the same stage and grade of 
atypia. 

Figure 4 shows the correlation between transcript levels from genes 
expressed in at least one sample. Fig. 4A demonstrates the repeatability of 
20 microarrayexpresaon analysis. Duplicate determinations on a normal sample 
ai« compared. Rg. 4B is a plot of a pTa tumor vs. the normal pool. Fig. 4C 
is a plot of an invaave pT2 tumor versus the normal pool. The vast majority 
of tramcripts are presMit at similar levels in both normal and tumor tissue. 
Figure 5 shows dendrograms of tissues based on different clustering 
25 methods. Qustering was ehher based on log-fold change in expression level 
of genes (Figs. 5A, 5C), or the absolute difference (Rgs. SB, 5D), comparing 
tumor to a pool of normal samples. Genes used for clustering were either 
diose 10% of the genes that covaried best whh progression (A, B), or all 4076 
graies that were scored as present in at least one sample (C, D). 
30 Figure 6A through 6F show how the psttwn of expression changes 

during progresaon of bladder canca: based on levels of transcripts in pools of 
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nonnal biopsies, superficial pTa tumors grade n, and invasive pT2+grade IV 
tumora. The curve at the top left portion of each subfigure shows the direction 
of change in gene expression based on pools of normal urothelium (open 
circle), superficial pTa tumor (gray cirde) and invasive pT2+ tumor (black 
5 circle). Fold change in gene expression level was calculated on a probe-to- 
probe basis using 20 probes per gene and eliminating the highest and lowest 
outiiers (olympic scoring). It is noteworthy that reduced expression is the most 
coratnon event. 

Figure 7 shows a duster diagram of 9 bladder tumors representing the 
10 progression of bladder cancer. Each column represents atumor preparation, 
and each row a g^e. The diagrams show dustering based on log-fold change 
from nonnal urotheUum (left diagram) and based on absolute difference from 
nonnal urothelium (right diagram). A decreased expression is displayed as 
shades of cyan, an increased ejqjression as shades of yellow, and no change as 
15 bladt. 

The dendrograms at each ade show the relation between the different 
genes. In the middle, distinct fiinctional clusters are identified and members of 
the dusters are annotated in brief (for fidl length ID of all genes in the diagram 
and Genbank numbers see www.mdl.dk/supplementaiy data). In an effort to 
20 identify those genes most indicative of cancer progression a weighting scheme 
was used to sdect the 400 genes that covaiicd best with the different stages of 
bladder cancer. Gene dustering was based on normalized Euclidean distance 
(vector angle) calculated between goies or gene duster centers. 

Fipire 8 shows the vector angle between pools and individual single 
25 cell preparations or biopsies. The numbers refer to patient samples and the 
stage ofeach is indicated. Pools are identified as follows: squares, Ta grade 
I pool; diamonds, Ta grade H pool; drdes, T2-4 grade m pool; triangles. T2-4 
grade IV pool. The lowest angle for each sample determines the whether the 
sample is classified as a superfidal (Ta or Tl) or muscle-invasive tumor (Tl- 
30 T4). 
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Figure 9 shows a comparison of Northern blots and oligonucleotide 
arrays. The samples analyzed were nonnai pool (Norm), superficial pTaGrI 
tumor (335), rainimaHy invasive pTl grade HI (901), and invasive pT2 grade 
m (713). The Northern blots were scanned by densitometry and plotted (solid 
5 lines) together with a plot of the level detected on the arrays (dotted lines). 
The levels of expression ranged from 3-6,000 units (beta -2 microglobulin) to 
100-600 uiits (E-cadherin). The level of transcripts d^ected was similar with 
both methods. 

Figure 10 shows immunohistocheniicai stainmg of the tissue sections 
10 used for expresaon analysis. On each section the protdn examined is indicated 
and the level measured on the oUgonudeotide array. Arrows indicate stained 
urotheUal cells in cathepan E (297 arbitrary units), ApoE (389 units), and 
CD59 (260 units) stainings, and stained stromal cells or leukocytes in beta2- 
microgiobulin (2481 units) and cystatin C (941 units). 
15 Figure 11 presents a model of gene expression evente during the 

progression of bladder cancer. The top of the figure shows the stages of 
bladder cancer, and the lower part shows the sequence of transcriptional 
events. The color cyan identifies reduced expression, yeUow increased 
expression (also mdicated by arrows). The figure is based on data from cluster 
20 analysis, and combines the different cluster methods. 

Table 1 shows genes which were highly expressed in bladder wall. 
Expression is shovra in "connective tissue" which includes muscle and 
submucosal cdls, a TaGr in, a TjGrin, and a TjGt IV bladder tumor. Genes 

above the 90* percentae arts grouped according to the purported fimction of 

25 the protein. 

Table 2A shows Wgh iiaensity genes in bladder wall compared to single 

cell solutions and biopsies of tumors. 

Table 2B shows expression of genes related to bladder wall. 
Table 3 shows the number of genes that are expressed as in the tumor- 
30 pool to which the tumor belongs, or altered as in a tumor pool of Mgher or 
lower stage or grade. 
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Several of the tables described above contain Usts which include both 
genes and expressed setpence tags (ESTs). Reference to the "genes" of a table 
shall be understood to indude the gene containing the EST. 

jfYTAXXim nESrwTTmnN of TfW. TNVEWT I QW 

It is a discovery of the present inventor that characteristic pattenis of 
expiesaon of genes canbeusedto diaractcrize different types of tissue. Thus, 
for example, gene expression patterns can be used to characterize stages and 
grades of bladder tumors. Similarly, gene expression patterns can be used to 
distinguish cells having a bladder origin from other ceUs. Moreover, gene 
expression of ceUs which routmely contaminate bladder tumor biopsies has 
been identified, and such gene expression can be removed or subtracted from 
patterns obtained from bladder biopsies. Further, the gene expression patterns 
of single-cell solutions of bladder tumor ceUs have been found to be fsr freer 
of interfering expression of contaminating muscle, submucosal, and connective 
tissue cells than biopsy samples. 

Working with human tumor material requires biopsies, and working 
witiiBNArequirrafre^y frozen or immediately processed biopsies. Biopsies 
mevitably contain many afferent ceU types in addition to cancer cells, such as 
cells present in blood, connective and muscle' tissue, endothelium etc. In the 
case ofDNA studies, raicrodUsection or laser capture are methods of choice, 
however, the time^i^jendent degradation ofBNA makes it difficult to perform 
manipuJation of the tissue for more than a fisw minutes. Furthennore, studies 
of expressed sequences may be difficult on the few cells obtained via 
microdissection or laser capture, as these may have an expression pattern that 
deviates from the predominant pattern in a tumor due to intratumoral 
heterogeneity. 

High-density expression arrays were used to evaluate the impact of 
bladder waU components in bladder tumor biopsies, and tested preparation of 
single ceU solutions as a means of eliminating the contaminants. The results of 
these evaluations permitted the deagn of methods of evaluating bladder 
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samples without the interfering background noise caused by ubiquitous 
contanunaling submucosal, muscle, and connective tissue cells. 

The evaluating assays of the invention may be of any type. WWle Wgh- 
density expression arrays can be used, other techniques are also contemplated. 
These include other tedmiques for assaying for specific mRNA species, 
including RT-PCR and Northern Blotting, as weU as techniques for assaying 
for particular protein products, such as EUSA, Western Blotting, and «izyme 
assays. Gene expression patterns according to the present invention are 
detsmined by measuring a gene product of a particular gene, including mRNA 
and protein. A pattern may be for one or more genes. 

Using the results provided in the accompanying figures and tables, a 
gene is indicated as being expressed if an intensity value of greater than or 
equal to 21 is shown. Conversely, an intensity value of less than 21 indicates 
that the gene is not expressed above background levels. Comparison of an 
expression pattern to another may score a change from expressed to non- 
expressed, or the reverse. Alianativdy, changes in intensity of expression may 
be scoi^l, either increases or decreases. Any statistically significant change 
can be used. Typically changes which are greater than 2-fold are suitable. 
Changes which are greater than 3-fold or 5-&id are highly significant. 

Apattem of characteristic expression of just one gene can be usefiil in 
characterizing a ceU type source or a stage of disease. However, more genes 
may be usefully analyzed. Usefiil patterns include expression of at least one, 
two, three, five, ten, fiftetsi, twenty, twenty-five, fifty, seventy-five, or one 
hundred informative genes. As used herein, the phrase "stage-specific 
reference pattern" refers to a pattern of gene expresaon characteristic of a 
given stage of progression ia a bladder tumor. A stage-specific reference 
pattOTi can include one or more genes listed in Table 4 and/or one or more 
genes Usted in Table 8 and/or one or more genes listed in Table 9 and/or one 
or more g^es listed in Fig. 6. 

RNA orprotan can be isolated and assayed firora a test sample using 
any techniques known in the ait They can, for example, be isolated from fiesh 
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or frozen biopsy, from fbmalm-fixed tissue, from body fluids, such as blood, 

plasma, serum, urine, or sputum. 

Stage of a bladder tumor indicates how deeply the tumor has 
penetrated. Superficial tumors are termed Ta, and T,^ are used to describe 
increasing degrees of penetration into the muscle. The grade of a bladder 
tumor is expressed on a scale of I-W (M). The grade reflects the cytolo^cal 
appearance of the ceBs. Grade 1 ceUs are almost normal. Grade H cells are 
sUghtly deviant. Grade IH cells are deariy abnormal. And Grade IV cells are 
highly abnormal. 

As used herein, the reference to genes which are expressed in 
"submucosal, smooth muscle, or connective tissue" or patterns of expression 
in "other cell types" can include the expression of one or more of the genes 
Usted in Table 1 and/or one or more of the genes listed in Table 6. The term 
"connective tissue ceU" includes any stromal ceU such as fibroblasts, 
macrophages, mast cells, granulocytes, etc. The data provided herein of 
expression for submucosal, smooth muscle, and connective tissue can be used 
in at least three ways to improve the quaUty of data for a tested sample. The 
genes identified in the data as expressed can be excluded from the testing 
altogether or tested but eliminated from the analysis. Alternatively, the 
intensity of expression of the genes expressed in the submucosal, smooth 
muscle, and/or connective tissue can be subtracted from the intensity of 
expression determined for the lost tissue. 

Patterns can be compared raanuaUy (by a person) or by a computer or 
other machnie. An algoiithra can be used to detect amilarities and (Hfferences. 
The algorithm may score and compare, for example, the genes which are 
expressed and the genes wiach-are not expressed. Alternatively, the algorithm 
may look for changes m mtensity of expression of a particular gene and score 
changes in intensity between two samples. A variety of such algoiiflims are 
known in the art. Similarities may be determined on the basis of genes which 
are expressed in both samples and genes which are not expressed in both 
samples or on the basis of genes whose intensity of expression are numerically 
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sfanaar. Differences are considered significant when they are greater than 2- 
fold, 3-foid or S-foId from the base value. Alternatively, a mathematical 
approach can be used to conclude whethw differences in the gene expression 
exhibited by different san^les is significant (see. e.g., Golub et al.. Science 286, 
53 1 (1999). One approach to determine whether a sample is more simUar to 
or has maximum similarity with a given condition (e.g.. a particular grade or 
stage of tumor progression) is to compare the EucUdean distances (see CSohib 
et al. and Example 6) between the sample and one or more pools representing 
di^t conditions for comparison; the pool with the smallest vector angle is 
then chosen as the most similar to the test sample among the pools compared. 

The data collected and disclosed here as "connectwe tissue" is 
presumed to contain both smooth muscle and submucosal gene expresaon as 
weU. Thus it represents the composite expression of those ceH types which can 
typically contaminate a bladda- tumor biopsy. 

Genes identified as changing in various stages or grades of bladder 
cancer can be used as markers for drug screening. Thus by treating bladder 
cancer cells with test compounds or extracts, and monitoring the expression 
of genes identified as changmg in the progression of bladder cancers, one can 
identify compounds or extracts which change expression of genes to a pattern 
which is of an earUer stage/grade or even of normal urotheUum. As used 
her«n,thereferencetoexpre5aonofgene3 in "normal urothelium" or "normal 

urothdial cells" can include one or more genes listed in Table 7. 

As demonstrated below, pools of tumors of a similar stage or grade, 
particulariy bladder tumors, can be made and the expression of the pool 
evaluated. The expression data of the pool can be used to define a stage or 
grade of tumor. The use of the pool minimizes the variations found from 
individual tumor to individual tumor of the same grade or stage. The 
expression data of the pool can then be used as a comparator to wMch 
individual tumor samples are compared, in an effort to categorize, 
prognosticatcandprescribetheturaorsamples. The methods described herem 
for classifying the stage or grade of a tumor can be combined with sequence 
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analysis of genes vA^so expression is altered compared to normal tissue in the 
Individual patient. In particular, mutations in key genes such as tumor 
suppressor genes can hdp to refine the appUcation of the gene expression 
results to diagnosis and prognosis. 

As used herein, the refeence to expression of "genes in bladder cancer" 
or "genes in a bladder tumor" can include one or more gwes listed in Table 4 
and/or one or more of the g^es listed m Table 8 and/or one or more genes 
listed in Table 9. 

The above disclosure generally desaibes the present invention. A more 
complete understanding can be obtained by ref^-ence to the following specific 
examples which are provided herem for purposes of illustration.only, and are 
not intended to limit the scope of the invention. 

Quantification of gene expression using micr<Kitrays 
Material 

Bladder tumor biopsies were sampled fi"om patients after informed 
consult was obtained, and after removal of the necessary amount of tissue for 
routine pathological examination. Tumors examined were 335 (stage pTa 
gradel), 837 (pTa GrII), 901 (pTa GrlE), 320 (pTl GrlE), 713 (pT2 Grill). 
RNAfi^m six diflferent tumors of the same stage and grade was combined to 
form each pool Four sudi pools were prepared (pTa GrI pool, pTa Grll pool, 
pT2+ Grin pool, and pT2+ GrIV. pool), Nonnal bladder mucosa biopsies from 
36 patients with prostatic hyperplasia or incontinence were pooled (as RNA) 
to obtain a nonnal urothelial reference. Single cell solutions were made by 
diant^rating biopaes on ice with a scalpel and a syringe followed by filtering 
through a 50 micron filt^. 

Pr^pacsition gfmRNA 

Total RNA was isolated using the KNAzol B RNA isolation method 
(WAKXhemie Medical GMBH). Poly (A>^RNAwas isolated by an oligo-dT 
selection step (Oligotex niRNA kit from Qlagen). 



15 



pr^p^ratinn nf cTtNA. 

One ^g iriRNA was used as starting material for the cDNA preparation. 
The first and second strand cDNA synthesis was performed using the 
Superscript Choice System (Life Technologies) according to the 
manufecturer's instructions, except that an oligo-dT primer contaimng a T7 
RNApolymerasepromotersitewasused. Labeled cKNA was prepared usmg 
the MEGAscrip In Vrtro Transcription kit (Ambion). Biotin labeled CTP and 
UTP (Enzo) was used in the reaction together with unlabeled OTP's. 
FoUowing the IVT reaction, the unincorporated nucleotides were removed 
using RNeasy columns (Qiagen). 
/\ n -a v hvhridi7ation nnd sff snning 

Ten fig of cKNA was fragmented at 94" C for 35 min. in a 
fragmentation buffer containing 40mM Tris-acetate pH 8. 1, lOOmM KOAc, 30 
mM MgOAc. Prior to hybridization, the fragmented cRNA in a 6xSSPE-T 
hybridization buffer (IMNaCl, lOmMTris pH7.6, 0.005% Triton) was heated 
to 95-C for 5 min and subsequently to 40°C for 5 min before loading onto an 
Affymetrix probe array cartridge (HuGeneFl set array, part No. V900160). 
The probe array was then incubated for 16 h at 40X at constant rotation (60 
rpm). The washing and staining procedure was performed in the Affymetrix 
Huidics Station. The probe array was exposed to 10 washes in 6xSSPE-T at 
25X followed by 4 washes in 0.5xSSP&-T at 50°C. The biotinylated cRNA 
was stained with a sti^tavidin-phycoetythrin conjugate, 10 }ig/ml (Molecular 
Probes, Eugene, OR) in 6xSSPE-T for 30 min at 25 "C followed by 1 0 washes 
m 6xSSPE-T at 25°C. The probe arrays were scanned at 560 nm using a 
confocal laser scanning microscope with an argon ion laser as the exdtation 
source (made for A^metrix by Molecular Dynamics). The readings from the 
quantitative scanning were analyzed by the Affymetrix Gene Expression 
Analysis Software. 

To compare samples, normalization of the data was necessary. For that 
purpose we compared scaling to total GAPDH intenaty (sura of 3', middle, 



16 



5"pix*e sets) of 7000 units with scaling to a total chip intensity (global scaling) 
of 281850 units (averaging 150 units per probe set). Both gave siraUar results 
with scaling factors that diflfered less than ten percent in a set of escperiments. 
Based ontWs we chose the global scaling for aUejcperiments. The variation in 
hybridization inteisrty following global scaling in biopsies from the bladder wall 
is shown in Fig. 1. 

P.yAMPLE 2 

Interference of bladder wall components on aq^ression profiling of bladder 
tumor biopsies 

Biopsies contain epithelial ceUs that most often are the targets for the 
studies {e.g., in the identification and characterization of carcinoma ceUs), and 
in addition many other cells that contaminate the epitheUal cell fraction to a 
varying extent. The contaminants include histiocytes, endotheUal ceUs, 
leukocytes, nerve ceUs, muscle cells etc. Microdissection is the method of 
choice for DNA examination, but in case of expression studies this procedure 
is difficult due to RNA degradation during the procedure. Our approach has 
been to gently remove the epitheUum and monitor the ©cpression in the 
remaining subraucosa and underlying connective tissue (the bladder wall). 
Genes expressed at high or low levels in the bladder wall should be 
interrogated when perfonning expression monitoring of the urothelium and 
urothefiai tumors. A similar approach could be used for studies of ^itheUa in 
other organs. 

We gently swaped off some of the normal urothdium lining the bladder 
lumen from bladders removed at cystectomy for bladder cancer. Then biopsies 
were taken from the denuded subraucosa and connecUve tissue, reaching 
approximately 5 mm into the bladder wall, and immetKately disint^ated in 
guanidinium isothiocyanate. Total RNA was extracted fixjm four different 
cystectomy specimens, pooled, and poly(A)* niKNA was prepared from the 
pool foUowed by conversion to double-stranded cDNA and in vitro 
transcription into cRNA containing biotin-labded CTP and UTP. 
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The labeled sample was hybridized to a set of 4 atiays containing 7074 
probe sets for human genes. A total of 1491 of the examined genes (21.1%) 
were scored as present, and 120 (1.7%) as present but rare. The percentile 
distribution of the expression intensity was (90%, 1308; 75%, 383; 50%, 163; 
5 25%, 85; 10%, 47). Genes above the 90* percentile (Table I) were grouped 
according to the purported fiinction of the protein (Table 1. first column). 
Many of the highly expressed genes belong to a group of genes that encode 
protdns involved in transcription and translation, probably reflecting that these 
genes generally are highly expressed in the various ceU types present in the 
10 bladder waU, and corresponding to recent data on yeast. Structural proteins 
such as keratins and proline rich proteins are highly expressed whereas coUagen 
genes are only medium expressed. Extremely high expression is shown by the 
cystic fibrosis antigen gene, the SlOO calcium binding protein, the cystatin B 
and the cytokeratin 13 genes that are all above 10,000 units. 
15 To evahiate the influence of bladder wall tissue in urothelial tumor 

biopsies, we monitored the expression level in 3 biopsies from transitional cell 
carcinomas (one superficially invasive (#733-2) and two muscle invasive 
(#733-1 and #879-1). The expression intensity in the tumor biopsies of genes 
thatarehi^yexpressedinbladderwaUarelistedin Table 1. Many genes are 
20 expressed to the same magnitude in the tumor biopsies as in bladder wall, 82 
genes (5%) were present at a level above 1308 in aU samples, and above tiie 
75% percentile of the bladder waU sample intensity (383 units) 210 genes were 
expressed in aU three biopsies as.weU as in bladder waU. Genes tiiat were not 
expressed in bladder wall but present in tiie urotbeBal biopsies amounted to 
25 196. 

Genes that are expressed and genes tiiat are not expressed in bladder 
wall can botii inteifee with the interpretation of the expression in a biopsy, and 
should be interrogated when mterpreting expression intensities in urothelial 
tumor biopsies, as the bladder wall component of a biopsy varies in amount 
30 firom biopsy to biopsy. 
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It 15 remarkable how similarly many genes belonging to the groups 
encoding metabolically active proteins, transcr^tion and translation related 
proteins, mitochondrial and nucleoproteins, are expressed in the different 
samples (Table 1). It seems reasonable to expect that it is incompatible with 
5 cellular function to stray from a narrow interval regarding these genes. 
Although some of the examined cells are malignant of atypia grade IV, which 
is a sev^ morphological deviation from normal, the key cellular functions are 
obviously still und^ strict control 

Twenty sbc gaies wa^ ©pressed at an intensity above 1308 in bladder 
10 wall and more than five times lower in tumor biopsies. These genes, marked 
with bold (Table 1), include keratins (7 genes) encoding proteins like keratins 
type n, 4, and 6. Anoth^ prominent group are the genes encoding proUne-rich 
prxrtdns (5 genes). Th^ gene expressions can be used to monitor the amount 
of bladder wall present in a given biopsy of tumors. The tumor biopsy 879-1 
15 obviously has a larger bladder wall component than the other biopsies, as it 
contains keratin 13 and several other highly exf^ressed bladder wall mRNAs at 
a low level, but higher than the otha: specima:js (Table 1). 

An interesting resuh was the SlOO caldum binding protein A7 gene 
transcript that was hi^ express^ in bladd^ wall and totally absent from the 
20 other biopsies (also absent from a number of other examined tumor biop^es 
from bladder). As all samples were collected with the same procedures, it 
indicates tiiat tiiis expression is either individual and occurroi by chance in the 
patients from whom we rmoved the bladder biopsies, or, more Ukdy, that 
the presence of urothelial RNAases degrade this transcript very &st. 
25 Bladder tumors have a reduced intercellular cohesion, and easily 

disintegrate into single ceil sobtions. To eliminate bladder wall cells from the 
urothelial tumor cells, five tumors were disintegrated mto smgle cell solutions 
before extraction of KNA, and compared to three tumors where RNA was 
extracted from the biopsy directly. We expected that this disintegration 
30 procedure might lead to an enrichment of tumor ceils and loss of connective 
tissue cells. Examination of g^es highly expressed in bladder wall (Table 2A), 
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showed a siimiar expressioii in single ceU solutions of bladder tumor cells 
compared to biopsies. However, the level was much lower than seen in the 
bladder waU and raised the question whether the expressed KNAs originale 
from the bladder waU or from the urotbdial cdls. 

To answer this question we cxanuned the expression of genes expected 
to be presort in bladder wail (Table 2B). Some of these genes were expressed 
in the bladder tumor samples, and probably mdicate the presence of bladder 
wall components in tiiese. It was striking tiiat the single cell solutions contained 
much lower expression levels of these genes compared to the biopsies 
(p<0.004). Altiiough the number of examined tumors was small this indicates 
that preparation of angle cell solutions may reduce tiie presence of bladder wall 
cells in the samples. The absence of keratin 8 in the bladder wall sample 
demonstrated tiiat this sample was devoid of urotiveUal ceUs (Table 2B). 

The genes known to be related to tiie bladda- wall components, showed 
a variable level tiiroughout tiie samples. Some genes like myosin light chain 2 
gene, fibroblast tropomyosin gKie and aipha-1 collagen type IV gene, were 
generally more expressed in the tumor samples than the other genes (Table 
2B). We hypotiiesize that tMs may reflect that there is a differential expression 
of genes in the connective tissue component that inevitably is included in a 
tumor bfopsy, an expression that may deviate from the one found in the bladder 
wall fimher away from the tumor. An example of this differential expression 
inbbdder wall was tiie presence of tramcript from the myosin Ught chain g«ie 
in the tumor samples but not in the bladder wall biopsies (Table 2B). 

In tumors many important events take place in the non-epithelial 
compartment Tumors need a comiective tissue support, they need blood 
vessels, they interact witii the-immune system and have interceUular signaling 
with various sorts of other ceUs in a compUcated way that has not yet been 
clarified in detail. Gene e:q)ression originating from the non-epitivelial 
compaitmoit contributes to the expression proffle of a tumor, and might be of 
great importance in relation to tiie clinical outcome and therapeutic response 
of the tumor. 
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F.YAVtPLE3 

E,^mprqfiMgqf mm pools to assess mcHvidual tumor heterogeneity. 

We expected tumors fiom the same pathology stage and grade to have 
a more similar gene expression than tumors fiom different stages and grades. 
5 TiHs was the case; however, these (Terences were not striking 

exceptions. Based on this we hypothesized that, a given tumor scored by a 
pathologist at Ught microscopy to be similar to another one may actually 
represent tumors with a remarkably different gene expression. The reason for 
this could be that tumors do not progress from a weE defined stage to another 
10 weU defined stage, but rather that a continuous change is taking place at the 
expression level - which is then only partly reflected by morphology. 

We tested this hypothesis by examining the expression which differed 
between a pool of tumors firom a given stage and a single tumor, to see 
whether the differentially expressed genes were linked to a certain stage or 
15 whether they were a random deviation. It was evident that tumor expression 
that deviates from the pool systematically have expressions belonging to either 
a lower or a higher stage, or both, or unique expressions not seen in the pool 
(Fig. 3). The low stage Ta tumor bad 43 genes whose expressions were 
increased or decreased similar to the ones seen in a Stage 2 grade IV tumor 
20 pool, and only three expressions that were altered in the opposite direction of 
stage 2 grade IV. The stage 2 grade W tumor had 33 genes whose expression 
was other increased or decreased similar to the ones seen in the Ta superficial 
tumor pool, and only 8 genes altered in the opposite direction. In the T2 grade 
IE tumor, gene expressions that were increased or decreased sunilar to the 
25 superficial or the grade IV invasive pool could be found. Furthermore some 
gene expressions were only low in tius grade m tumor, and hi^r in both 
superficial and grade IV tumor pools. The dinical information on the examined 
single tumors (Table 3) paralleled the expression findings as the mtermediate 
grade HI tumor was the first musde invasive tumor in a patiem who had had 
30 a superficial tumor five months earHer. It seems that this tumor has not reached 
the level of raaUgnancy as seen in the other invasive tumor. The latter was of 
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grade IV, and was a big solid tumor with muscle invasion at first visit. The 
superficial Ta tumor v/as the fifth recurrence and was followed by two new 
recurrences 64 and 159 days later— also of superficial nature. 

The genes that were identified in lower stage and grade tumors and 
5 shown to be similar to the expression in high stage tumors are listed in Tables 
4A and 4B. These genes may "signal" a high©: stage or grade, or represent a 
transition firom low stage or grade to high stage or grade. To aid in avoiding 
interpreting bladder wall expression as tumor cell-specific expression, the 
expression level in bladder wail is listed in Tables 4A and 4B. Two columns 

10 are shown which simulate an increase in bladder wall content to 20% and 50% 
of the sample. These columns were obtained by adding a 20% or 50% 
contribution firom the "bladder wall" column to the appropriate remaining 
percentage contribution (80% or 50%) firom the TaGrll Pool column. Single 
tumor expression level (column labeled *Ta single tumor"') was interrogated m 

15 this context. Expression levels which are unlikely to be due to bladder wall 
contamination are shown in bold; other expressions are shovm in regular font. 
We believe this procedure is usefiil and leads to reliable conclusions. 

These genes form a complex group of genes with highly different 
fimctions. It is not totally unexpected that mucin synthesis is changed, nor that 

20 cytokeratin 15 is decreased when moving fi-om Ta and to higher stage» The 

gene expressions which signal a higher grade of atypia in akeady invasive grade 
in tumors are, among others, immunology related genes. This may indicate 
that the more atypical cells are either surrounded by inflammatory cells, or that 
the tumor cells start synthesimg these protdns. Further investigation is needed 

25 to elucidate this point, and these proteins will be an interesting parameter to 
follow in relation to clinical course in the fiiture. The strong up-regulation of 
cathepsin B may indicate an increased proteolytic attack against the connective 
tissue. 

The cause of the changed expression is unknown and could be either 
30 a transcriptional regulation or secondary to gain or loss of chromosome 
material. Both mechanisms are known to occur in cancer ceils. 
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P.yAMPLE4 

Chmse of transcript level during the progression of bladder cancer. 

Biopsies from human bladder tumors were analyzed as single tumors 
or as pools of tumors representing the diferent stages in the progression of the 
bladds- cancer disease. We used a total of 5 angle tumors and 4 tumor pools, 
each pool made by combining sk tumors. To generate a nonnal reference 
material, we pooled biopsies from normal bladder mucosa from 35 volunteers. 
The biopsies were disintegrated into single cell solutions immediately after 
i^oval, filtered and snap frozen in guanidinium isothiocyanate. From the cdl 
solutions RNA was extracted, reverse transcribed to cDNA and the cDNA 
transcribed into labelled cRNA, that was incubated on the chip cartridges 
foUowed by scamiing and scaling to a global cWp intensity amounting to 1 50 
units per probe set. The scaling made it possible to compare individual 
experimeatsto each other. To verify the reproducibility, double determinations 
were made in sdected cases and showed a good correlation (Fig. 4A). 

We compared gene expresaon at three diffwent steps in the 
progression of bladder cancer to each other by the use of the normal pool as 
a reference. A scatter plot of the noninvasive pTa grade one tumor and the 
invasive highly abnormal grade four pT2+ tumor showed a minor subfraction 
of the gene transcripts to deviate much from those in a» normal urotheEum. 
The large majority of transcripts were within a narrow range in both tumors 
and nonnal urothdium (Fig. 4B.C). The raimber of deviating genes was higher 
in the most abnonnal tumor. 

We then analyzed transcripts that showed afterations larger than five- 
fold, when comparing three different pools representing the tranation from 
normal urothelium to superficial tumor, and further on to invasive transitional 
cea carcinomas (TCC). The method applied consisted in a probe-to-probe 
comparison (20 ^obes per gene) based on the software GeneChip® Analysis 
Suite 3 . 1 from A^etrix, Inc. Increased levels indicate that the transcript is 
either upregulated at the stated level or turned on de novo reaching a given fold 
above the background level. Decreased levels in a simUar way indicate 
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reduction or loss of transcript. Alterations of a single transcript during the 
progression of the bladder cancer disease can follow several different pathways 
(Fig, 6). Some of the transcript chianges reflect the transition from normal 
cells to tumor cells, and are group^l as TCC related genes (Fig. 6A, B). A 
5 distinct feature of group A was the presence of 6 smooth musde related genes. 
Others ^ altered only in superficial tumors, not in invasive toraors, and are 
grouped as Blackler papiU<mia related genes (Fig. 6Q D). Group C, with 
downregulated genes, contained 15 immunology^reiated g^s. Group D 
contained a variety ofgenes encoding proteins with different fiinctions. Finally 
10 some genes only showed an alteration in invasive tumors and are grouped as 
Invasive TCC related genes (Fig. 6E, F). The genes in group E encoded 
functionally unrelated proteins, whereas group F contained 5 immunology- 
related genes. Thus, it seemed possible to define groups of genes whose 
expression level is associated with the stage of bladder tumors. 

Cluster Analysis 

The level of a gene transaipt during disease progression can be thought 
of as a pattern that can be correlated to patterns of other gene transcripts. If 
the expression of one gene is very shnilar to the expression of another gene in 

20 several samples they are a correlated pair of genes. This pair of genes can th«i 
be condated to other gafies with a sinmlar transcriptional behavior in the set of 
tisaies examined, and together these constitute a gene cluster. In the next step 
the relation between clustm k established and a dendrogram of genes is 
formed, in which strongly correlating gene clusters are near each oth^. The 

25 prindples are describe in Eis^ et al.. Free. Nat!. Acad. ScL USA 9S, 14863 
(1998). Briefly, each gene vector was placed in its own clu^er, where the 
cluster prototype was set to the gene vector. AU pair-wise vector angles 
between cluster prototypes were calailated. The smallest vector angle was 
identified, and those clusters were m«:ged as a weighted average of the two 

30 prototype (and also a weighted average of all the gene vectors each prototype 
repressed), TheveOorai^esw^thenupdatedbetween the newly merged 
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dusters and the merger process was repeated. The final dusters are displayed 
k the order in which they were merged. 

Exactly the same procedure used to clusto: genes can be used to cluster 
the tissue samples, showing the relation between the different tissues based on 
their gene expression. We based clustering analysis on either the 4067 
transcripts being scored as present in at least one of the samples, or based on 
those 400 transcripts(see Table 9) that covaried best with a weighting scheme 
adding increaang values to increasing stages. 

The scaled AvgDif measures as calculated by the Affymetrix software 
were extracted for the normal pool and each of the graded tissues. Only the 
4067gene8 with an AbsCall of P (present) in at least one of the tissues were 
considered. AU AvgDif measures below 20 were set to 20. For each tissue 
and each gene, the AvgDif from the normal pool was either subtracted, to 
define the "absolute diffeience,''or divided and natural logarithm applied to 
define the "log-fold" relative measure. The relative expression measures for 
each tissue (log-fold or absolute difference) were used to cluster tissues by a 
hierarchical method usmg the Eucfidean distance between tissues. Tissue 
dendrograms were constructed with the PHYIJP program uang clustering 
order and distances. A weighting sdieme (see Example 6) for the seven 
observed stages and grades of cancer was" used to select 200 positively 
covarying and 200 negatively covarying genes with respert to progression. 
The same hierarchical method and a normalized EucUdean distance (vector 
angle) were used to cluster the top 400 poshively and negatively covarying 
genes for both relative expression measures. Gene dendrograms were 
constructed by the same method as for the tissue daidrograms. 
Ti ssue clusters 

Different algcmthms based on dths: fold change or absolute differences 
in transcript levels across the different samples were applied to all transcripts 
or only those covarying with a progression scale. Both methods were able to 
cluster the tissues according to the tumor's or tumor pools stage and grade of 
atypia in a meaningfiil way (Fig. 5). The two noninvasive and the two invasive 



25 



pools each clustered very closely together both using the fold change and the 
absolute difference, iiulicating a close gwetic relation between these, and 
indicating that one effect of pooling samples is a reduction of the variation in 
gene expression. The single tumor preparations showed a more varied 
distribution but still reflected the stage of the tumor. In the log-fold 
dendrograms (Fig. 5 A, C) the supsfidai tumors 335, 837 and 901 cluster close 
to the superficial pools, but the pTa grade III tumor ^1 seems closer to the 
superficial pools than the pTa grade II tumor 837. This may either be due to 
the variation in histopathologicai gradmg or due to the tumors having different 
genetic properties. The minimally invasive pTl grade in tumor 320 is 
correctly placed in between the muscle invasive and the superficial tumors, and 
the muscle invasive tumor 713 is placed veiy close to the pools of pT2+ 
tumors. Tumor 713 seemed to be closer to the pT2+grade IV than Grade lU 
pool although it was histopathologicaliy scored as grade m. In the absolute 
difference dendrogram (Kg. 5B, D) the superficial tumors 837 and 901 are 
doseiy related to tiie superficial pools, the pTl superficially invasive tumor is 
less related and finally the invasive tumor 713 located closest to the invasive 
pools. An excqjtion was the superfidal tumor pTa grade 1, 335-6 that deviated 
fixan all other tumors. Whether this tumor has unique properties is unknown, 
however it did not deviate firom the expected locatk}n in the dendrograms 
based on fold change. 

The dendrograms show that the clustering algorithms work very well, 
that the dataset obtained from the oligonucleotide arrays reflect the biological 
properties of the tumors, and that objective information on a tumor's stage and 
grade can be obtained firom matiiematical analysis of gene expression data. 
Furthermore, it is seen that when ranking based on covariance to the 
progresdon is used to extract the top 10% covarying genes, these have a 
dendrogram that is aknost identical to the one based on 4067 genes. We 
therefore used tiie ranking procedure when analyzing gene clusters. 
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r^f. filnsters 

Hw data obtained fitsm duster analyas are presented as colored images 
in v^Wch genes wth sinnkr ecpresaon patterns are clustered next to each other 
on the vertical axis and the samples according to stage and grade on the 
5 horizontal axis (Fig. 7). The color of each ceU in the tabular image represents 
the ratio between the sample expresaon of the gene in question and the 
expression in normal urothelium. The color saturation is directly proportional 
to the magnitude of the measured expression ratio, cyan indicating the lowest 
ratio, yellow indicating the highest ratio. Black imficates a ratio of one, a 
10 similar level of ecpression in tumor as in normal urothelium. The two different 
clustering methods, log-fold and absolute difference gave completely different 
clusters across the set of samples (Fig. 7). 

la the log-fold based cluster analysis, the top 200 positively covarying 
genes can be divided into five different dusters containing fiinctionally related 
15 gene8(Fig.7,leftuppercolumn). The duster shown at the top contains genes 
related to ceU proliferation such as cycling A and E, PCTAIRE-1, and 
SWI/SNF. The next cluster mainly contains oncogenes and growth factors. 
Genes in both these dusters are expressed at a levd dose to that seen in 
normal urothelium in superficial tumors (black) and increase during disease 
20 progression (yellow). The two clusters at the lower part show a reduced 
ecpreaaon level in the superfidal tumors compared to normal (cyan) and th^ 
an increase above the normal urotheUal level m invaswe tumors (shades of 
yellow). These clusters contain a set of immunologically related genes, like 
different MHC's and immunoglobulins, cancer related genes like src4ike kinase 
25 and Fas/Apo-1. and finally another immunologically related duster at the 
bottom. 

The 200 negatively covarying genes (Fig. 7, Irffc lower column) could 
be divided into three different clusters based on log-foW change and fimction 
of the genes. The upper cluster comains genes related to cell adhesion like 
30 laminins, integrins and P-cadherin (Fig. 7, left lower column). They all show 
a reduced levd of expression m the invasive tumors as evidenced by the cyan 
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coloring to the right. The small middle cluster contains four genes related to 
transcription, and finally the lowest cluster in the figure contains five 
proteinases, like cathepsm E (two difEerent probe sets for the same gene) and 
metalloprotdnase as weB as a protease inhibitor. The lower clusters are 
characterized by an increase in level in superficial tumors (yeUow) followed by 
a reduction to a level below nonnal urotheBura in invasive tumors. 

In the absolute difference based cluster analysis the top 200 covaiying 
genes that showed a positive covariancc contained only few cfaisters having a 
fianctional relation. The upper cluster (Fig. 7, right upper colunm) contained 
five genes related to cell proliferation like the raicrotubule-assodated protein 
and oncoprotdn 18/stathnnn. The next dusterwas a set of inununology related 
genes like MHC and LERK-2. Both these clusters showed an increased 
expresaon level in invasive tumors compared to normal urothelium. The cluster 
at the lower end of the figure showed a reduced level in superficial tumors and 
a return to normal or increased level in invaMve tumors. This cluster contained 
many immunology-related genes like MHC, HIA and immunoglobulin genes. 
Finally, for genes that showed a negative covariance based on absolute 
difference (Fig. 7, right lower column), this was mainly due to clustering of 
ribosomal genes. A very tight cluster in the middle of the graph show 
ribosomes that are upregulated in expression in superficial tumors and 
downregulated or unaltered in invasive tumors. The middle ribosomal duster 
is generally expressed at a lower level than in normal urothelium, whereas the 
cluster at the bottom of the figure is simUar to the one in the middle. Other 
genes that seemed to duster were a smaU tight duster of immunology related 
genes, and two wmor inWbitors, TGP'htAa. superfamily protein and Suil in the 

uppermost duster. 

Thus, a pattern of altered gaie transcription occurs during the 
progression of bladder cancer that involves a number of genes bdonging to 
functionally different gene femiiies. Cluster analyss identified many biologically 
relevant genes, and in that aspect was superior to the probe-by-probe 
comparison described above. 
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Where dp(X,Y)= '"i 



and len(X)»sqrt( 



Z7J ^ 
) 



After the sample had been compared in this way to each pool, invasive and 
non-invasve, the pool which differed from the sample by the smaEer angle was 
5 detOTninedtobethedassofthesample. 

Samples from ten bladder tumors were assigned by the classifier, and 
the results are shown in Fig. 8. The classifier was able to correctly call the 
presence or absence of muscle invasion in all ten samples examined (7 
non-rausde invasive, 3 muscle invasive), based on categorizations made by a 
10 pathologist. To cross-validate the classification methodology, two non- 
mvaave (Ta) and two invasive (T2) pools were compared wth tumors from 
each of 10 patients. Since two pods were avaUable for each stage, four 
possible combinations (classifier s^s) of one Ta pool and one T2 pool were 
tested. The dasafication rate was calculated as the number of tumors correctly 
15 identified times 10. The results for each of the four classifier sets were 

averaged to obtain the data shown m Kg. 8. 

EXAMPLE? 

Confirmation ofmicroarrc^ expression analysis by Northern blotting 

In order to confirm the array data, Northern blotting was performed on 
20 the same samples ofRNA as used for array hybridization. A standardized 

amount of ENA was run in each lane, foUowed by blotting with a labeUed SNA 

probe, and quantitation of the band obtained (Fig. 9). 

Total RNA, 0.5-4 ng per lane, was separated in 1.5% agarose- 

fonnaldehyde gels, transferred onto Zeta-Probe® nylon membrane (Bio-Rad) 
25 by poMtive pressure (Posiblotter, Stratagene) and immobilized by baking for 

20 min at 120'C. The filters were hybridized with digoxygenin- labelled (DIG) 

RNA transcribed from 600-1000 bp PCR products containing a T7 promoter 
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incorporated via the antisense primers. Filters were hybridized with 10 ng 
probe per ml of ultrahyb™ hybridisations solution at 68'C for 16 h and washed 
to a stringency of O.lx SSC at 68'C. Specific hybridization was detected by 
reacting the membrane with monoclone anti-MG antibodies conjugated with 
alkaline phosphatase, incubating with ECF cbemifluorescence substrate 
(AmershamPharmacia) and scanning on a Storm 840 (Molecular Dynamics). 
The hybridization signals were quantified with ImageQuant 5.0 software. 

As can be seen from the plots, the oUgonucleotide array and the 
Northern blot gave similar resuUs with the different probes, both in genes 
expressed at a high level (beta2-microglobulin), and those expressed at a very 
low level (CD59). 

EXAMPLES 

Immunohistochemical localization of expressed proteins 

The biopsy samples used to study gene expression in bladder tumors 
contain ceUs other than urothelial cells, although the amount of other cells 
should be limited due to the use of single ceU solutions. We therefore used 
immunostaining of tissue sections from the single tumors examined to 
detemune wWch ceUs expressed the protein encoded by the transcript in 
question. We used the transcript levels to sdect a group of proteins supposed 
to show variation ftom sample to sample, making posidble a rough correlation 
between level of ptotdn detected and Intensity of the transcript on the 
microairay. 

Four \m sections w^e cut fi-om parafBn-erabedded tissue blocks, 
mounted, and deparafiBnized by incubation at SO'C for 10 min, followed by 
immersion in heated oU at 60°C for 10 rain (Estisol 312, Estichem A/S, 
Denmark) and rehydration. Antigen retrieval was achieved in TEG (Tris- 
EDTA-GIycerol) buffer using microwaves at 900 W. The tissue sections 
cooled in the buffer for 15 min before a brief rinse in tap water. Endogenous 
peroxidase activity was blocked by incubating the sections with 1% HjO, for 
20 min, followed by three rinses in tap water, 1 min each. The sections were 
then soaked in PBS bufEer for 2 rain. The next steps were modified from the 
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descriptions given by Oncogene Sdence Inc., in the Mouse 
ImmunoMstochenustry Detection System, XHCOl (UniTect, Uniondale, NY, 
USA). Briefly, the tissue sections were incubated overnight at 4''C with 
primary antibody (agwmt beta-2 microglobulin (Dako), cytokeratin 8, 
5 cystatin-C (both from Europa,US), junB. CD59, E-cadherin, apo-E. cathepsin 
E, vimentin, IGFE (aU from Santa Cruz), Mowed by three rinses in PBS 
buffer for 5 nun each. Afterwards, the sections were incubated with 
biotinyiated secondary antibody fia- 30 rain, rinsed three times wUh PBS buffer 
and subsequently incubated with ABC (avidin-biotinlylated horseradish 
10 peroxidase complex) for 30 min, foUowed by three rinses in PBS buffer. 
Staining was performed by incubation with AEC (3-aniino-ethyloari)azole) for 
10 min. The tissue sections were counter stained with Mayers hematoxylin, 
washed in tap water for 5 rain, and mounted with glycerol-gelatin. Positive and 
negative controls were included in each staining round with all antibodies. 
15 We found several of the proteins to be expressed not only by urothelial 

cells but also by leukocytes, endodieUal ceUs or histiocytes (Table 10, Fig. 10). 
Of the examined proteins only keratm 8 and ApoE were confined to 
urothdhim; the other proteins were also present in other ceU types. Based on 
the assumption that transcript and protein originates from the same cell, this 
20 clearly indicates that conclusions on the origin of the transcripts requires a 
histological examination, or other verification procedure. The amount of 
stroma in a biopsy, the vascularization (amount of endothelial ceUs), the level 
of leukocyte infihiation, and the grade of atypia of the urothelial ceUs were all 
parameters th^ seemed to influence the level of a gjven transcript. 
25 The level of proton identified by immunostwning, disregarding the cell 

type expresang the protein, correlated well with the transcript level measured 
on the microairay (Rg. 10). However, no attempt was made to quanthate the 
immunostaining due to the often large heterogeneity in staining across the 
sections. 
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Qanbank 

AB000584_al 

AB002533_at 

AB0025S9_at 

AB003102_al 

ABOO6702_al 

AC002045_xpl2_s_at 

ACa02073_cds1_at 

AC002115_cds1_at 

AC002t15_cds4_at 

AD000092_cds1_at 

AF000562_at 

AF001359_Lat 

AF009368_at 

AF015913_at 

D00017_at 

D00408_s_at 

DQ0596_at 

O00654_at 

D10523_at 

D11086_^at 

D11088_al 

D11094_at 

D11327_s_at 

D13118_at 

D13413_ma1_s_at 

D13643_^ 

D13705_s_at 

D13748_at 

D14043_at 

D14530_at 

D14710_at 

D1Q294_at 

D18S62_at 

D16581.at 

D17516_at 

D17516_at 

D17525_al 

D21083_at 

D21281_at 

D236e0_at 

D25218_at 

D25218_al 

D25248^at 

D25248_at 

D25278_at 

D25303_at 

D2B129_at 

D28528_at 

D^535_s_at 

D26599_at 

D28383.at 

D28589_at 

D28915_at 

D29012_at 



Gana ID 

AB00Q584 

KPMA4 

AB002559 

AB003102 

AB0067e2 

AC002045 

AC002073 

AC002115 

AC002115 

CH19HHR23 

AF000562 

AF001359 

AFOoases 

SKB1 

HUMLIC 

HUMXYPFLA 

HUMTS1 

HUMACTSG7 

HUM2DGDH 

HUM1L2RG 

HUM1L2RG 

HUMMSS1 

HUMLCPTP 

HUMATPSCP1 

HUMTA120 

HUMRSC390 

HUMOMHY 

HUM4AI 

HUMMGC24 

HUMRSPT 

HUMIPASAS2 

HUMDSAEC 

HUMATPSGL 

HUM80DGTP 

HUMPACAPR 

HUMPACAPR 

D17525 

HUMORFAAA 

HUMORFFA 

HUMRSP 

HUMORFN 

HUMORFN 

HUMRES44 

HUMRES44 

HUMORFO 

HUMIAS 

HUMRNASA 

HUMRNA 

HUiyiDS 

HUMPSH2 

HUh/lASB42 

HUMKG1E 

HUMHGAft^PS 

HUMPSY 



Oana nama 

TGF-bata auperfamily protaln. 
Qipi. ckaiYopherin alpha 4 

26S proteasome subunrt p44.5, 
8alecUn-9 isoform, 
Chromosome 16 BAG clone CIT987 
PAC done DJ515N1 from 22qt1,2 
DMA from overlapping chromosom 
DMA from overlapping chromosom 
DMA from chromosome I9p13.2 co 
uroplakln H mRNA, 
DMA mismatch repair protein M 
Luman mRNA, :Luman "mRNA," / 
SKBIHsmRNA. :skb1 \(S. pomt)e 
lipocoitin II, :annexlnll 
fetal liver cytochrome P-450 
gene for thymfdylate synthase, 
gene for enteric smooth muscle 
2-oxoglutarate dehydrogenase, 
interleukin 2 receptor gamma 
interleuKin 2 receptor gamma 
MSS1, :proteasome Mprosome, 
protein-tyroslne phosphatase, 
PI ATP synthase subunrt c. : 
lumor-assodated 120 KDa nud 
KIAAQ018 gene, :KIAA00t8 gen 
fatty adds omega-hydroxylase 
eukaryotlc Irwtlation factor 
MGC-24. : "MGC-24," complete 
homolog of yeast ribosomal pro 
ATP synthase alpha subunit, 
mttochondriai 3-oxoacyl-CoAt 
ATP synthase gamma-subunlt \{ 
8^xo-dGTPase, :"8^o-dGTP 
PACAP receptor, radenylatec 
PACAP receptor, zadanylatec 
precursor of PI 00 serine prot 
KIAA0030 gene, 
KlAA0120gene, nransgelin2 
rlb<^mal protein, rribosom 
KlAA0112gene» 
KlAA0112genB, 
mRNA, done:RES4-4. 
mRNA, done:RES4^. 
KIAA0036 gene, :KlAAQ036gen 
integrin alpha subunlt, 
rfbonudease A \(RNase A\), 
RHAhellcase, :DEAD/H MAsP- 
gene for dihydrollpoamWe succ 
proteasome subunft HsC?"!, : 
ATP syn^ase B chain, 5'UTH 
mRNA \{KJAA0016T^, partial se 
gene for hepatitis C-assodate 
proteasome siibunilY, :pn5t 



covariance 
av dlf neg 
av dif neg 
av dlf neg 
av dlf neg 
av dlf neg 
av dif neg 
av dif neg 
av dif neg 
av dlf neg 
av dif neg 
av dlf neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dlf neg 
av dif neg 
av dif neg 
av dif neg 
av dlf neg 
av dif neg 
av dlf neg 
av dlf neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dlf neg 
av dif neg 
av dif neg 
av dlf neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dlf neg 
av dlf neg 
av djfneg 
av dif neg 
av dif neg 
av dlf neg 
av dlf neg 
av dlf neg 
av dif neg 
av dif neg 
av dif neg 
av dlf neg 
av dlf neg 
av dlf neg 
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HUMORFA02 


D2fidSfi at 


HUMORFA10 


nsGASfi at 


HUMELP4AII 




HUMORFKG1C 


L/O 1 f 0*r^al 


HUMORFKG1C 


n^iAR'^ of 


HUMQRFKG1L 




HUMORFKG 1 M 




HUMORPKG1M 
owfViwr\f^r>w i ivi 








HUMHLAAD 


WiipOirt f 


HUMPSP31 




HUMORF0D8 


□36583 at 


HUMCOLO 


042046 at 


HUMKIAAJ 


D42046 at 


HUMKIAAJ 


04^7_at 


HUMKIAAK 


D43682]^s_at 


HUMVLCAD 


D45370_at 


HUMUPST1 


D49396lat 


HUMA0P1 


D494a8"'at 


HUMHTTP 


D49728 at 

WtPC I 111° 


HUMNAK1 


D49824 s at 


HUMHLABAA 


D50B40 at 


D50625S16 


C^3478 at 


KIAA0144 


n83479 s at 


D63479 


D83485 at 


KIAA0151 


nS!^86 at 


K1AA0152 




D63851 


D78129 at 


HUMHL1115B 


078275 at 


PSMC8 


D79205 at 


D79205 


D79984 a at 


D79984 


D79984 s at 


079984 


D80002_at 


O8D002 


D82345_at 


D82345 


D86425_at 


086425 


D8697*_al 


086974 


D86985_at 


KIAA0232 


D87258_at 


087258 


D87735_at 


RPL14 


D87953_at 


087953 


D89052_at 


ATP6F 



HG1034-HT1(mj_8t <8nipty> 
HG14Qa4fri4(K)„s„at <empty> 
HG1428"HT142S_s_al HG1428-HT1428 
HG1428-HT1428 s^at HG1428-HTt428 
HG1 51544X151 5_Lat HG1515-HT1515 
HG151^HT1515 f at HG1515-HT1S15 
HG1614-HT1614.at HG1814-HT1614 
HG1614-HT1614„at HG1614.HT1614 
HG1800-HT1823_8t <empty> 
HG1872-HT1W_at <8mpty> 
HG1872-HT1907_at <8mpty> 
HG1 98CmT2023_at <emply> 
HG2147-HT2217_r.at <empty> 



KIAA0052 gene, avdlfneg 

KlAA0116gene, avdifneg 

eukaryotic initiation factor av dif neg 

KIAA0064 gene, :KIAA0064gen avdifneg 

KIAA0064 gene, :KIAA0D64g8n avdifneg 

KIAA0059 gene, : KIAA0059 «*g av dIf neg 

KIAAOOeS gene. :KIAA0063gen avdifneg 

KIAA0063g8ne, :KIAA0063gen avdifneg 

KiAA0067 gene. :SET domain, avdifneg 

HIA dass-l \{HIA-A26\) heavy av dlf neg 

26S proteasome subunit p31 , av dlf neg 

KIAA0079 gene. :Sec24p,S,C avdifneg 

calgizzarin, : "caiglzzarin, avdifneg 

KIAA0083 gene, :DNA2\{DNAr avdifneg 

K1AA0083 gene» :0NA2\(0NAr avdifneg 

KIAAOOaegene, ;KlAA0089"g avdifneg 

very-tong-chain acyl-CoA dehy av dif neg 

apM2 GS2374 Munknown product av dif neg 

Apo1_\(MER5XAop1''Mouse>^nk av dlf neg 

alpha4ocopherol transfer pro av dif neg 

NAK1 ONA binding protein, avdifneg 

HIA-B null allele mRNA. :HLA-8 av dif neg 

DlsiA for phosphodie^erase 3B, av dlf neg 

KlAA0144gene, :KlAA0144gen avdifneg 

K1AA0145 gene, avdifneg 

KIAA0151 gene, :KIAA01 51 gen av dlf neg 

K1AA0152 gene, :KlAA0152gen avdifneg 

uno-18 homologue, av dif neg 

squalene epoxkJase, av dif neg 

ppoteasome subunit p42, :pro avdifneg 

rlbo^mal protein L39, av dif neg 

KIAA0162gene. avdifneg 

KIAA0162 gene. sv dif neg 

KIAA0180gen8. avdifneg 

NS thymosin beta, av dif neg 

o^eonkSogen. av dif neg 

KiAA0220gene, :^K1AA0220^ avdifneg 

KIAAQ232gen6, :KIAA0232g6n avdifneg 

serin protease with IGF-blndi av dif neg 

ribosonral protein L1 4, :r1bo avdif neg 

RTP, : mP," complete cds avdifneg 

proton-ATPase-like protein, av dlf neg 
<emply> 

<empty> av dif neg 

i^'^GIobln,^ Beta** av dlf neg 

;*^GIoblnr Beta" av dif neg 

:Transcriptlon Factor Btf3b av dJf neg 

•.Transcdption Factor BtfSb av dif neg 

•Protein Phosphatase "1 ,° Alp av dif neg 

:Prmein Phosphatase ; Alp av dif neg 
<empty> 

<empty> av dlf neg 

<empty> av d'rf neg 

<8mpty> sv dif neg 

<empty> avdifneg 
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HG21474fr2217_L.at <empty> 
HG2149-HT2219_at <8mpty> 
HG2167-HT2237„at <8mpty> 
HG2197-HT2267^5^at HG2197^HT2257 
HG2238-KT2321^5.at HG223S.HT2321 
HG2239-HT2324_L.at <empty> 
HG2239-HT2324_r_at <empty> 
HG2264-HT2360_al <empty> 
HG2279-HT2375.at HG2279.HT2375 
HG2566-HT4S67 at HG2566-HT4867 
HG278S-HT289e_at HG2788-HT2896 
HG281 S-HT2931_at <empty> 
HG2Bl5-HT2931_s_at <empty> 
H62ai5-HT4023_s_at <8mply> 
HG2873^HT3017_at <empty> 
HG2917-HT3061J^at HG2917-HT3061 
HG2917-HT3061J^at HG2917-HT3061 
HG2981-HT3127_s_at <empty> 
HG2994-HT4S50_s„at <8mpty> 
HG3039-HT3200_at <empty> 
HG3076-HT3238_S„at HG3076-HT323B 
HG3107-HT3283_s_at <Bmpty> 
HG3107-HT3283.s_at <empty> 
HG311-HT31 l.at <empty> 
HG3214-HT3391_at <empty> 
HG3236-HT3413Xat <empty> 
HG3254-HT3431_at <em{^y> 
HG3342-'HT3519 s at HG3342-HT3S19 
HG3364-HT3541^a7 HG3384-HT3S41 
HG33-HT33„at HG33-HT33 
HG3484-HrF»78js„at <:8mpty> 
HG3514^T3708 at HG3514-HT3708 
HG35434HT3739_at HG3543.HT3739 
HG3543.HT3739„at HG3543-HT3739 
HG3549-HT3751_at HG3549-HT3751 
HG3S7(W^773^at HG3570-HT3773 
HG3576-HT3779Xat <emptf> 
HG3578-HT3779J„at <emp^ 
HG3731-HT40Q1_at <©mpty> 
HG384-HT384„^ HG384-HT384 
HG384-HT3a4„al HG384-HT384 
HG3945-HT4215_at <empty> 
HG3991-HT4261_L.at <empty> 
HG4020-HT4290_s_at HG4020.HT4290 
HG4258-HT4528_aT <emply> 
HG4319-HT4S89„8t HG431 9-HT4589 
HG4338-HT4a06_at <empty> 
HG4533-HT4938_at <empty> 
HG4542-HT4947„at HG4542-HT4947 
HG4557-HT4962_r„at <empty> 
HG466WiT5083.S_at <8mpty> 
HG4668-KT5083_s_at <emply> 
HG4749'HT5197_at <8mpty> 
HG613-HT613^at HG613-HT613 
HG61 3-HTai 3_at HG813-HT81 3 



<8m?^y> 
<emj^y> 
<empty> 

:--Conage/''Type"^»,"" 

f Nuclaar umio Apparatus P 

<8mpty> 

<emply> 

<8mply> 

:Trtosef^osphate teomerase 

•.yicrotutaile-Associated Prote 

:Calcyclin 

<wipty> 

<enipty> 

<empty> 

<Bmpty> 

fUBjov Histocompatibility 

f Major Histocompatibility 

<empty> 

<0mpty> 

<enipty> 

:"Het8rogeneous Nuclear Ribon 

<0mpty> 

<empty> 

<empty> 

<8mr^y> 
<empty> 
:W1 

:R|bosomai Protein L37 

<empty> 

<empty> 

;Tropomyosin TmSOnm,^ Cytosk 
ilnsuUn-Lifee Growth Factor 2 
rlnsutln-Uke GrowUi Factor 2 
-mim'S Tumor-Related Protein 
:Protein Phosphatase inhiblto 
<empty> 
<empty> 
<eini:^ 

:Ribosomal Protein 128 
•.Ribosomal Protein 1^6 
<empty> 
<8mp^ 

:Transglutaminase 
<empty> 

:Rlbosomal Protein L5 
<empty> 

:R|bosomat Protein L10 
<8ini:^> 
<8mpty> 
<emptp 

<8mpty> 

;RlbDsomal Protein S12 
:Rlbosomal Protein S12 



av dif neg 
av dif neg 

av dif neg 
av dif neg 
av dif neg 
avdif neg 
av dif neg 
av dif neg 
av dif neg 
avdif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
avdif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif ne^ 
av dif neg 
avdif neg 
av dif ne^ 
av dif neg 
av dif neg 
av dif neg 
av <!Kf neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
avcfif neg 
avdif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif neg 
av dif nsg 
av dif neg 
av dif neg 
av dif nag 
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HG821-HT821_at 
HG880-HTBM_at 

HG9874fr987„at 

J00105_s_at 

J02811_at 

J02811_at 

J02683.s_at 

J02783_at 

J02874_al 

J02874„at 

J02902_Bt 

J028C»„at 

J03077_s_at 

J03242_s_at 

J03242_s_at 

J03692^at 

J03758_at 

J03801J_at 

J03909^at 

J03g09_at 

J03934_S„at 

J04093_s_at 

J04093_s_at 

J04152_ma1_s_at 

J04164_al 

JD4164„at 

J04173_,at 

J04811_at 

J04815_at 

J04617_s_at 

JCK973_^ 

J05036„s_at 

J05272Lat 

K02405J_at 

K03189J_at 

K03430_at 

K03430_at 

LQ0634_s_at 

L02326J_al 

L0427a_at 

L04483_s_at 

L04490_at 

L05072_s_at 

L05188J_at 

U»499_et 

L06797_s_at 

LO0797_s.at 

UJ7044_al 

L08868_at 

L09209_$_at 

Lt0413_at 

L11568_at 



<0mpty> 

<empty> 

<empty> 

HG987-HT887 

HSMGLO 

HUMAPOD 

HUMAPOD 

HUMATPC 

HUMTHBP 

HUMALBP 

HUMALBP 

HUMP2A 

HUMCYPIIF 

HUMGLBA 

HUMGFIL2 

HUMGFIL2 

HUMTLCA 

HUMGHVA 

HUMLS2 

HUMHP 

HUMllP 

HUMNMOR 

HUMUGT1FA 

HUMUGT1FA 

HUMGA733A 

HUM927A 

HUM927A 

HUMPGAM 

HUMANP70 

HUMSNRAA 

HUMEFIA 

HUMC0R2M 

HUMCTSE 

HUMCTSE 

HUNIIMPH 

HUMMHDC3B 

HUf^CGBELOS 

HUMC1QB2 

HUMC1QB2 

HUMFPTA 

HUMPREBLYM 

HUMTNFRFIP 

HUMRPS21X 

HUMNADH 

HUMIFNRF1A 

HUMSPRR2B 

HUMRPL37A 

HUML12A 

HUMGPCR 

HUMGPCR 

HUMCCDPKB 

HUMPORIN 

HUMAMYLOID 

HUMFTA 

HUMRPL18A 



<empty> avdifneg 

<empty> dif neg 

<eiiipty> avdifneg 

:Mac25:Mac25 avdifneg 

messenger RNA fragment for the av dif neg 

apollpoprolein D mRNA, :apoii avdifneg 

apollpoproteln D mRNA, :apol{ avdifneg 

ADF/ATP csaiTler protein mRNA. av dif neg 

thyroid hormone binding protef av dif neg 

adipocyte lipld-binding pnstel av dif neg 

adipocyte Jipjd-binding protei av dif neg 

protein phosphatase 2A regulat av dif neg 

cytt«:hrome P450liF1 protein \( av dif neg 

co-beta glucosldase \(proactiv av dif neg 

insulin-Ike growth factor 11 m av dif neg 

tnsulin-lke growth factor II m av dif neg 

ADP/ATP translocase mRNA. 3' e av dif neg 

growth honmone-variant MGHi \) av dif neg 

tysozyme mRNA. complete cds wi av dif neg 

gamma-interferon-inducibJe pro av dif neg 

gamma-lnterferon-lnduciWe pro av dif neg 

Human, NAD\(R^)H:menadjone oxi av dif neg 

phenol UDP-glucuronosyltransfe av dif neg 

phenol UDP-glucuronosyltransfe av dif neg 

gaslfolntestinal tumor-associa av dif neg 

interferon-Inducible protein 9 av dif neg 

interferon-inducible protein 9 av dif neg 

phosphoglycerate mutase \(PGAM av dif neg 

iupus p70 \(^^^ autoantlgen p av dif neg 

lupus autoantlgen Hsmall nud av dif neg 

elongation factor EF-1-alpha g av dif neg 

cytochrome fcK>l complex core p av dif neg 

cathepsin E mRNA, :cathepsin avdifneg 

cathepslnEmRNA. rcathepsin avdifneg 

IMP dehydrogenase t^ 1 mRNA av dif neg 

MHC dass ii HLA-DC-^beta gen av dif neg 

chorionic gonadotropin beta su av dif pos 

complement Clq B-chain gene, e av dif pos 

complement Clq &-chaln gene, e av dif pos 

famesyt-protein transferase a av dif pos 

Xdone Hu Iamlxla-i7\) lamMa- av dif pos 

Xdone CD18V tumor necrosis av dif |x>5 

ribosomal protein S21 \(RPS21 av dif pos 

Xdone CC6^ NADH-ubfcjUlnone av dif pos 

Interferon regulatory factor 1 av dif pos 

small proline-rich protein 2 av dif pos 

ribosomal protein L37a \(RPL37 av dif pos 

ritx)Somal protein LI 2 mRNA, : av dif pos 

Xclone L5^ orphan G jroteln- av dif pos 

\(done L5^) orphan G protein- av dif pos 

caidum/calmodulln-dependent p av dif pos 

porin \lpc^ mRNA, complete c av dif pos 

amyloid protein homologue mRNA av dif pos 

famesyitransferase alpha-subu av dif pos 

ribosomal protein L18 \(RPL18 av dif pos 
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L11672„at 

L11708_at 

L11708„8t 

L12711«.s_at 

L12711_s.at 

L19493_S_8t 

L19527_at 

L19688_rna1_8t 

L19688_ma1_at 

L19779_at 

L20941_at 

L21954^al 

L21954„at 

L26247_at 

U7943_at 

L32866_at 

U2976_at 

L33075_at 

L33243_at 

L33842_ma1_at 

L33842_ma1_at 

L33930_s_at 

L37127^at 

L38490_s_at 

L3a928_at 

L38941.al 

L39059-at 

L40357_at 

U0379_at 

U0387_at 

L40392_at 

L40904_at 

L41870_al 

W2176^al 

L42373_at 

L42542^at 

L78159_at 

L76465_at 

L778W_at 

M10612_st 

M11119_at 

M11147_at 

M11313_s„at 

M11353.at 

M12529_at 

M12529_at 

M12888_^ 

W2888_at 

yi3207_at 

M13560_S_al 

M1366e_at 

M13755_Bt 

M13829_s„at 

M13829_s_at 



HUR^RUPZN 

HUMB17HSD 

HUMB17HSD 

HUMTRANSKE 

HUMTRANSKE 

HUMFMR1R 

HUMRPL27 

HUMMIF 

HUMMIF 

HUMH2A2A 

HUMLYGDI 

HUMFERRITH 

KSFBR4 

HSPBR4 

HUMSUIISO 

HUMCYDE 

HUMEPR1NP 

HUMMLK3A 

HUMIQGA 

HUMPKD1A 

HUMIMPDH 

HUMIMPDH 

HUMCD24B 

HUMRPIA 

HUKAADPRF 

HUMMETSYN 

HUMRPL34A 

HUMfTFSLIA 

HUMTRIP7M 

HUMTRIP10M 

HUMTRIP14G 

HUMORFB 

HUMPPARGB 

HUMRB1MRNA 

HUMORAL 

HUMPP2A 

HUMRIP1R 

HUMFRG1R 

HUMPGDHB 

HUMPTPC 

HUR/lAPOCIl 

HUMERRNA 

HUMFERL 

HUMA2M 

HUMH1SH3C 

HUMAPOE 

HUMiftPOE 

HUMTC8YY 

HUMTCBYY 

HUMCSFGMA 

HUMIAIG6 

HUMCMYBB 

HUMIFNISK 

HUMPKS 

HUMPKS 



Kruppei related zinc finger pr 
17 beta hydroxysteR)id dehydro 
17 beta hydroxyslerold dehydro 
tran5tetolas8\(tK\)mRNA, f 

tmnsk8to^ase^(^^^)^nRNA' 
FMR1 gene, S'end. 
rib(^omal protein U7 \(RPLZ7 
macrophage migration Inhibitor 
macmp^iage migration inhibitor 
histone H2A,2mRNA, :H2Ahist 
GDP^issociation inhibitor pro 
femlin heavy chain mRNA, :f 
peripheral benzodiazepine race 
peripheral benzodiazepine race 
suilisot mRNA, 

cytldlne deaminase \(CDAV niRN 
effector ceil protease recef^o 
prxytein kinase \(MLK-3^) mRNA, 
raa GTPase-activating-like pro 
polycystic kidney disease 1 pr 
\{done FFE-7^) type II inosin 
\(done FFE'7^ type li inosin 
CD24 signai transducer mRI^, c 
RNA polymerase 11 mRNA, :poiy 
ADP-ritx>sylatJon factor mRNA, 
5,10-methenyltetrahydrofoiate 
ribosomai protein L34 \(RPL34 
transcription factor SL1 mRNA, 
thymW receptor Interactor \( 
thyroKS receptor interactor \( 
thymy receptor interactor \( 
V{ctone S164\) mRNA. 3' end of 
H. sapiens pemxisome prolifer 
retinoblastoma susceptibility 
\(clone 35,3\) ORAL mRNA, :\( 
phosphatase 2A B56-alpha \(FP2 
RLIP76 pnrtem mRNA. :RLIP76 
FRGImRNA. :FSHD region gene 
NA£>M3ependent 15 hydroxyprost 
pmtein tyrtjsine phosphatase m 
apoli(K}proteln Oil gene, 
endogenous retfoviius envelope 
fenltin L chain mRNA, :feni 
aipha-2-macroglobulfn mRNA, : 
H3.3 histone dass C mRNA, 
apoUpoprotein E mRNA, :apoli 
apollpKJprotein E mRNA. :apoli 
T-ceil receptor active beta-ch 
T-cell receptor active beta-ch 
granulocyte-macrophage coiony- 
la-assodated Invariant gamma- 
o-myb mRNA, 3' end. 
Interferon-induced 17'KDa/1S^k 
putative raf related protein 
putative raf related protein 



avdif pos 
avdif pos 
avdif pos 
avdif pos 
avdif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
avdif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif FX}s 
av dif tx)S 
av dif pos 
avdifpc^ 
av dif pos 
av dif FK>s 
av dif pos 
avdif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
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M13903„at 

M13929,s_at 

M13929_8_at 

M13934_ccls^rt 

M13955_Bt 

M14199_s_at 

M14199_s_at 

M14328_s_at 

M14483_ma1_s_al 

M14876„at 

M14876_at 

M1539S_at 

M15661_at 

M15661_at 

M16038_at 

M17733_at 

M17883js_at 

M17863_s_at 

M178B5_at 

M17886_at 

M18000_at 

M18737_mal_al 

M1904SJ„at 

M19159_at 

M19159.at 

M19301_at 

M19878_s_at 

M2090^at 

M20902_£^ 

M21142_cds2_s_at 

M21142_K3s2_s_at 

M2118slat 

M21186_at 

M21302_at 

M21984_al 

M22490.at 

M22960_at 

M23178_s_at 

M23613_at 

M24194_ert 

M24194_« 

M244e5_s„at 

M24488_s_at 

i«5079.s_at 

M25079_s_Ht 

M25280.at 

M28311_s_Bt 

5*2631 1_s_at 

M26665j5„at 

M26708_s_at 

M»730_s_at 

M27281_at 

M27749_r_at 

M27749_r_at 

M27826_at 



HUMINV2 

HUMMYCPOA 

HUMMYCPOA 

HUMRPS14 

HUMKERMII 

HUMLAMR 

HUMIAMR 

HUMENOA 

HUMTKYMAA 

HUMSLK 

HUMSLK 

HUMLAP 

HUMRPaCI 

HUMRPZH21 

HUMLYN 

HUMTHYB4 

HUMFFI2B 

HUMFFI2B 

HUMPPARPO 

HUl«PPARP1 

HUMWS17A 

HUMHFSP 

HUMLSZH 

HU^^ALPPD 

HUMALPPD 

HUMKAD 

HUMCALB01 

HUMAPOCIA 

HUMAPOCIA 

HUMGNAS8 

HUMGNA86 

HUMNCBLCA 

HUMNCBLCA 

HUM8PR2B 

HUMTRT 

HUMBMP2B 

HUMPPR 

HUMG0S19A 

HUMNf^ 

HUMMHBA123 

HUMMH8A123 

HUMGSTP1G 

HUMPYHBA5A 

HUMBETGLA 

HUMBETGLA 

HUMLWHR 

HUMCFA 

HUMCFA 

HUMHtS2X 

HUWtPTAA 

HUMQBPCB 

HUMVPF 

HUMIGLR141 

HUMIGLR141 

HUMRTVLH3 



invotucrin gene, exon 2. :invo av dif pos 

o-myo-P84 mRNA, inHlallno fro av dif pos 

o-myo-P84 mRNA, initiating fro av dif pos 

ribosomai protein S1 4 gene, : av dif pos 

mesothelial keratin K7 Mtype av dif pos 

lamlnin receptor \^H5 ep'itope av dif pos 

iaminin receptor \(2H5 epitope av dif pos 

alja« enolase mRNA. -.enolase av dif pos 

pwfthymosln alpha mRNA, av dif pos 

src-«keldnasB\(sll«S)mRNA, av dif pos 

sfC-llKe WnasB \(sll(\) mRNA, av dif pos 

leukocyte adhesion protein ML av dif pos 

rtbosomai protein mRNA, :rit>o av dif pos 

ribosomai protein mRNA. :ribo av dif pos 

lyn mRNA encoding a tyrosine k av dif pos 

thymosin beta^ mRNA, nhymos av dif pos 

ptBproinsuiin-iike growth fact av dif pos 

preproinsuiin-ilke growth fact . av dif pos 

acidic ribosomai phosphoprotei av dif pos 

acidic ribosomai phosphoprotei av dif pos 

rtbosomai protein S1 7 gene, : av dif pos 

Hanukah factor serine protease av dif pos 

iysorymemRNA. -."lysozyme ""m av dif pos 

placental heat-stable alkaline av dif pos 

pdacentai heat-stabie alkaline av dif pos 

branchejWhain aJpha-keto acid av dif pos 

caibindin 27 gene, exons 1 and av dif pos 

apollpoprotein C-l \(VLDL\) ge av dif pos 

apolipoprotein c-l \0/LJDL\) ge av dif pos 

guanine nucJeotkJe-binding pro av dif pos 

guanine nucleotWe-binding pro av dif pos 

neutrophil cytoc^irome b light av dif pos 

neutropha cytochrome b light av dif pos 

sn»il proline rich protein \{s av dif pos 

\(clonePWHToT16\)steletalmu av dif pos 

tone motphogenetlo proteln-2B av dif pos 

protective protein mFUi^, :pro av dif pos 

homologue-l of gene encoding a av dif pos 

nudeoj^min mRNA, :nucieoph av dif pos 

MHC proton homologous to chic av dif pos 

miC protein homologous to chic av dif pos 

ydone pHGST-pl\) glutathione av dif pos 

prolyl 4-hydroxyiase alpha sub av dif pos 

sidUe cell beta-gfobin mRNA, av dif pos 

sidUe ceil beta-gtoWn mRNA, av dif pos 

lymf^ node homing receptor mRN av dif pos 

i^sUc fibrosis antigen mRNA, av dif pos 

c^o fibrosis antigen niRNA, av dif pos 

hlsta«r»2\(HIS29mRNA. :hi av dif pos 

prothymosin alpha mRNA MProT- av dif pos 

mitochondrial ubk^ulnone-bindi av dif pos 

vascular permeability fador m av dif pos 

immunoglobulin-related 14.1 pr av dif pos 

immunoglobulin-related 14.1 pr av dif pos 

endogenous retroviral protease av dif pos 
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M27891_at 
IA2S212_al 
M28882_s_at 

M29335^at 

M2933S_at 

M29810_s_at 

M30818_at 

M3C»38_at 

M31303_ma1_at 

M31303_ma1^al 

M31520_at 

M31520_ma1^s^at 

W31627.at 

y31994_at 

M32053_at 

M32304_s_al 

M32405.at 

M32886_al 

M33600J_at 

M33680_at 

M33884_s_at 

M34041_al 

M34182_at 

M34518_al 

M34516_r_at 

M34518_L.8t 

M34715_at 

M34996„s_at 

y34996_s_at 

M3519a_at 

M35252_at 

^35878_at 

M38072_at 

M37238_s_at 

M37245_at 

f^37245_at 

M37435_al 

M37583_at 

M37815_cds1_at 

f^38448_5_at 

iyi35890_^ 

M38690_^ 

M54995_at 

M5640&_5.at 

M55409_s_at 

M57293.at 

M57399_at 

M57399„at 

M57466_s_at 

MS7468_s_at 

M57710_at 

M58378_cd5l_at 

|gl58525„s_at 

M58525_s_at 



HUMCYS3A3 

HUMRAB6A 

HUMMUC18B 

HUMMUC18B 

HUMMHDOA 

HUMMHDOA 

HUMGLYE 

HUMMXB 

HUMKUP 

HUMOP18A 

HUM0P18A 

HUMRPS24A 

HUMRPS24A 

HUMHXBP1 

HUMALDC13 

HUMH19 

HUMMET 

HUMRIGA 

HUMSRICPA 

HUMMHDRAIC 

HUMTAPA1 

HUMPPPB1A5 

HUMADRA2RA 

HUMPRKACG 

HUM1GL122 

HUM1GL122 

HUMIGL122 

HUMPSBGAA 

HUMDQAIA 

HUyOQAIA 

HUMIHTB8A 

HUMCOOTAA 

HUM1BP3 

HUyRPL7A 

HUMPLC 

HUMIGCTL3 

HUMIGCTL3 

HUMCSDF1 

HUi^HlS2AZ 

HUMCD284 

HUMTGFBA 

HUMANTCD9 

HUyANTCD9 

HUWCTAPS 

hWMPANCAN 

HUMPANCAN 

HUMPTHRPA 

HUMHBNFi ' 

HUMHBNF1 

HUMMHDPL 

HUMMHDPL 

HUMBPIGE 

HUMSYN1E13 

HUMCOMTC 

HUMCOMTC 



cyslatin C \(CST3\) gone, axon 
GTP-Unding protein \(RAB6N) m 
MUCt8 glycoprotein mRNA, i^^^el 
MUC1 8 glycoprotein nnRNA, :mel 
MHO cJass II DO-alpha mRNA. 
MHC ciass I) DOalpha mRNA, 
gtycophorin E mRNA. :glycopho 
Interferon-lnduced celluiar re 
Ku \(p70/p80\) aubunit mRNAt 
oncoprotein 18 H0p18\) gene, 
oncoprotein 18 \(0p18^ gene, 
ribosomal protein S24 mRNA. 
tibosomal protein S24 mRNA. :r 
X t>ox binding protein-1 MXBP- 
aldehyde dehydrogenase \(ALDH1 
H19 F«4A gene, 
metalloproteinase Inhibitor mR 
homoiogue of rat insulinoma ge. 
sorcln CP-22 mRNA, isordnis 
MHO class II HLA-DR-beia-1 \(H 
28-lcDa cett surface pioteln TA 
\(done lambda-16-1\) non-rece 
alpha-2-8dfenergic receptor M 
testls-specafic protein kinase 
omega light chain protein 14,1 
omega light chain protein 14,1 
omega light chain protein 14.1 
piBgnancy-speclfJC beta-1-giyc 
MHC cell surface glycoprotein 
MHC cett surface glycoprotein 
Integrin B-8 mRNA, rintegrin, 
CO-029. rtransmembrane 4 super 
InsuUn-Uke growth facSor-bin 
ribosomal protein L7a \{surf 3 
phosphollpase C mRNA. :phosph 
Ig superfamlly cytotoxic T-lym 
Ig superfamUy cytotoxic T-lym 
macrophage-speclfic colony-sti 
histone \(H2A.Z?0 mRNA, :hlst 
T-cell membrane glycoprx}tein C 
transfofming growth fador-bet 
CD9 antigen mRNA. ;CD9 antlge 
CD9 sntHjen mRNA, :CD9antigB 
connective tissue activation p 
pancreatic tumor-related prote 
pancreatic tumor-related frote 
parathyroid hoTmone-related pe 
nejve growth factor \(HBNF«1\) 
nerve growth factor \{HBNF«1>) 
mo dass 11 HLA-DP light chal 
MHC class il HlJ^-DP light chai 
IgE-Wnding protein \(epsilon- 
synapsln i XSYNIX) gene, exon 
catechoK^methyttransferase 
catechoJ-Omethyltransferase 



av dif pos 
av dif pos 
av d'lf pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dff pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
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M6»216_s_at 

M59371^at 

M59807^8t 

M59830_at 

MSMII^at 

M60483_mal_s_at 

M60854_at 

M519l6_at 

M«403_s_at 

M62403_s_at 

M624«8_at 

M63256_at 

M63379_at 

Me3438_5_at 

Wi63438_3_at 

M03573_at 

M635«9„at 

M64347_at 

M84347_at 

MB4673_at 

M64718_al 

M6S292_5_at 

MS5292_s^at 

?iil69023^at 

Mi89066_at 

M69238^at 

M73077_at 

M73239„s_st 

M73547_at 

M74093_at 

M74297_at 

M7471S_$„at 

M77232„ma1_at 

y77836„at 

M80244_at 

M80254_at 

M80359_at 

M80563jat 

M8Q5e3_al 

M80899_at 

M81750_at 

M81757_at 

M81883_at 

M83t81_at 

W84424_St 

M84711_at 

M85289_at 

M88400_at 

M88699_al 

M86737_at 

M87789_s_at 

WI87789_s_at 

Ii/I90356j_at 

M90658„at 



UKAGABRB1S5 

HUhAECK 

HUMNK4 

HUMMHHSP2 

HUMINTA3A 

HUMPP2AA 

HUMSRAA 

HUMLAMiOl 

HUM1GFBP5 

HUMIGFBP5 

UMPRPC4S12 

HUMCDR2AA 

HUMTRPM2A4 

HUMIGGK 

HUMIGGK 

HUMSCYLP 

HUMSCL7 

HUMFGFLR 

HUMFGFLR 

HUMHSF1 

HUMRPS25 

HUMPROS30 

HUMHAAA 

HUMHAAA 

HUMGGEFERA 

HUMMOESIN 

HUMARMTA 

HUMGRF1A 

HUMSCFA1 

HUMPOLIA 

HUMCLNC 

HUMHOX14 

HUWIDNAL 

HU«RPS6B 

HUMP5CR 

HUME16GEN 

HUMCYP 

HUMP78A 

HUMGAPL 

HUMCAPL 

HUlWaiNAKA 

HUMMCNDA 

HUMS19RP 

HUMGAD57A 

HUMHTRB 

HUMCTSE09 

HUMFTE1A 

HUMHSPG2B 

HUMPHPLA2 

HUMTTK 

HUMHMGBP 

HUMIGHEPAH 

HUMIGHEPAH 

HUWiSTFO 

HUMGCSH 



gamma-aminobuiyric acid-A \(GA 
protein tyrosine kinase mRNA, 
NK4 mRNA, tnatural Killer cal 
MHO class ill HSP70-2 oen8\(H 
integrin aipha-3 chain mRNA, 
protein phosphatase 2A catalyt 
rtbosomal protein si 6 mRNA, : 
lamlnln 81 dmin mRNA, :lamin 
insulin-like growth factor Wn 
insulln-like grwth (actor bin 
C4b-bin(Jlng protein gene, exon 
major Yo paraneoplastic antlge 
TRPM-2 protein gene, axons 7,8 
Ig rearranged gamma chain mRNA 
Ig rearranged gamma chain mRNA 
secreted cyclophaiivHke prot 
stem C8« leukemia gene produc 
novel growth factor receptor m 
novei growth factor receptor m 
heat shock factor 1 NCTCFS^) m 
ritxjsomai (Wleln S25 mRNA, : 
prosomal protein P30-33K \(pro 
factor Hhomologue mRNA, :"fa 
factor H homologue mRNA, :"fa 
giolJtn gene, 

moeslnmRNA, :moe^n :moesin 
aryl hydrocart>on receptor nucl 
glucocorticoid receptor repres 
\(clona SF1\) hepatocyte growt 
poiyfxjsts locus \(DP1 gene^ m 
cydin mRNA. :cyclin El 
homeobOK 1,4 protein mRNA, :h 
alpha-L-iduronidas \(IDUA\) mR 
ribosomal protein S8 gene, com 
pyrrollne 5-carboxylate reduct 
E18mRNA, 

(^ophittn isofonn \(hCyP3\) 
protein p78 mRNA, :MAP/mlcrot 
CAPL pnrtrtn mRNA, :SlOOcalc 
CAPL protein mRNA, :S100calc 
novel iHXrtein AHNAK mRNA, part 
myeloid cell nuclear different 
Si 9 ribosomal protein mRNA, 
glutamate decart>oxylase \(GAD8 
serotonin receptor gene, :5-h 
cathepsin E \(CTSE\) gene, exo 
v-fos transformation effector 
heparan sulfate y^teoglycan 
phosphoiipase A2 mRNA, rtyros 
kinase \CTTK\) mRNA, :TTKpro 
high mofolifty group box \(SSRP 
Xhybridoma H210>) anti-hepati 
\(hybridoma H210^ anti-hepatJ 
BTF3 protein homologue gene, 
gamma-giutamylcysteine synthet 



av dif pos 
av dIf pos 

av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
av dif pos 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
iog neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
iog neg 
iog neg 
iog neg 
log neg 
log neg 
log neg 
iog neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
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M91670^al 
M94S56_at 

M94e80J_at 

M96233_s„at 

M96326_ma1_at 

M96956_at 

M97796_s_at 

M97815.at 

S34389_« 

S58544_at 

S69115_at 

S69115_at 

S71043^ma1_s_at 

S71043_ma1_s_at 

S73591_at 

S73591_at 

S75463_at 

S77358_al 

S77582_at 

S78234_at 

S78771.s_al 

S79219,5_at 

S79522^at 

S80582_at 

SS2297_at 

S82597_ma1_s_at 

S904e9_at 

U00947„s_at 

U03397_s_at 

l»33g8_at 

U04241_at 

U04313_at 

yD5340_at 

U0815S_s_al 

U08883_at 

U06863_al 

U09117_at 

U09303_at 

U09813_at 

U09953_8t 

U10362„at 

U10492„at 

U12404_at 

U12404_al 

U124e5„at 

U12779_at 

U14391.at 

U14588_at 

U14988_at 

U14969„at 

U14970_at 

U14971_at 

U14971_at 



HUME2EPI 

HUMFABPHA 

HUMFABPHA 

HUMHLAAX 

HUMGSTM4A 

HUMGSTM4A 

HUMAZGDI 

HUMTDGF3A 

HUMID2X 

HUMCRASP02 

HM0X2 

SPAG1 

S89115 

S69115 

S71043 

S71043 

VDUP1 

VDUP1 

S75463 

S77358 

S77582 

S78234 

S78771 

S79219 

S79522 

CNN3 

S82297 

S82597 

FOR 

U00947 

U03397 

TNFSF9 

U04241 

PI5 

CDC20 

U08155 

U0e863 

U06863 

PLCD1 

EFNB1 

ATPSG3 

U09953 

U10382 

HSM0X1 

U12404 

U12404 

U12485 

□12779 

MY01C 

PXH 

□14968 

U14969 

U14970 

U14971 

U14971 



ut^uitin carrier protein \(E2 log nag 

fatty acid binding protsin horn log neg 

fatty add binding protein horn log neg 

MHC class I \(HLA-A*8001 ^ mRN log neg 

glutathione transferase dass log neg 

glutathione transferase dass log neg 

azuroddin gene, 1^ 

Xdone CR-3^ leratocarclnoma log neg 

helix-loop-hellx protein \(ld- log neg 

retinoic acld-blnding protein log neg 

heme oxygenase-2 [human, ludne log neg 

75 kda infertility-related spe log neg 

granulocyte colony-sBmulallng log neg 

granulocyte cdony-stlmulating log neg 

Ig alpha 2=immunoQlobulln A he log neg 

ig alpha 2»lmmunoglobulln A he log neg 

brain-expressed HHCPA78 homolo log neg 

brain-expressed HHCPA78 homolo log neg 

P43=mitochondrial elongation f log neg 

transcript ch21 "ollgomydn sen log neg 
HERVK10/HUMMTV reverse transcr log neg 

nuc2 homoiog [human, flbroblas log neg 

NAT^CpG IsJand-assodated gene log neg 

metastasis-associated gene [hu log neg 

ubiquitin cartwxyl esdension p tog neg 

ackSIc calponin (human, kidney log neg 

ma 2-mlcroglobulln {1 1 bp del log neg 

UDP-GalNAc:pDlyp0|kkie log neg 

cytochrome P450 r^udase {hum log neg 
done C4E 32 \(CAC\)nA(GTG\) log neg 
receptor protein 4-1BB mRNA, log neg 
receptor 4-1 SB KgandmRNA, : log neg 
homoiog of Drosophila enhancer log neg 
masphimRNA, rprotease InhibI log neg 
pSSCDCmRNA, :cell division c log neg 
chromosome 1 q subtelomertc seq tog neg 
folllstatln-relaled protein pr log neg 

folllstatin-related protein pr log neg 

phosphoilpase c delta 1 mRNA. log neg 
T cell leukemia LERK-2 \(EPLG2 log neg 
mitochondrial ATP synthase sub log neg 
ribosomal protein L9 mRNA, :r log neg 
GP38b glycoprotein mRNA, log neg 

UoKi pixjteln moxn mRNA, : log neg 
Csa*19mRNA. ^^gneg 
Csa-19mRNA. ^^^^ 
fibosomal protein L35 mRNA, log neg 

MAP Kinase activated pmteln k log neg 
myosln^lC mRNA, :myosln!C log neg 
paxillin mRNA, '.paxlUin :pax log neg 

rtbosomal protein L27a mRNA. log neg 
ribt^mal protein L28 mRNA, : log neg 
ribosomal protein S5 mRNA. :r log neg 
hbosomal protein S9 mRNA, :r log neg 
ribosomal protein S9 mRNA, :r log neg 
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U14972.at 
U14973_at 
UtS177.al 

uie860_at 

U16861_at 

U17077_at 

U17780_ma1_at 

U19247_ma1_s_at 

U19251_s_m 

U20657_at 

U20734_s_at 

U20734_s_at 

U2«J768_ma1_al 

U22376_c(Js2^s_at 

U2243rs_at 

U22970_ma1_s.at 

U22970lma1^s_at 

U241&3_s_at 

U24a89_s_at 

U25789_at 

U27333_al 

U27333_at 

U27831_al 

U29175^at 

U29953„ma1_at 

U30827_s_at 

U30888_at 

U30S88_at 

U31814_at 

U31875_at 

U32944_at 

U34B80_st 

U36341_ma1_at 

U38764l«t 

U37012_at 

U37146_at 

U37408_al 

U37689_at 

U38276_8t 

U38276_al 

U39400_at 

U40998.at 

U41C»0„al 

U41768„s_at 

U42359_at 

U43328_at 

U43901_ma1_s_at 

U43901_ma1_s_at 

U45448_s_at 

U4a705_ma1_s^at 

U4B936lat 

U4893e_at 

U49395_at 

U49889_ma1_at 



U14972 ribosomal protein SI 0 mRN A, 

U14973 rtbosomai protein S29 mRNA, : 

U15177 cosmid CRWC2015 at D10S289 1 

ECH1 peroxisomal enoyl-CoA hydratas 

U16799 Na,K-ATPas8 beta-1 subunit mRN 

KCNJ2 Inward rectifying potassium ch 

BENE SENEmRNA. :BEN£ protein 

HSLAMB3S1 7 laminin S 83 chain \(LAMB3\) g 

HSINFGRA7 interferon-gamma receptor alpti 

fsjAlP neuronal apoptosis Inhibitory 

USP4 ubiqultin protease MUn|^\) pr 

U20734 transcription factor ]unB SQu 

U20734 transcription factor junB \0u 

U207S8 osteopontin gene, 

y YB \(o^nfiyb\) gene, complete prima 

U22431 hyp>xiaHndudble factor 1 alp 

U22970 Interferon-indudble peptide 

U22970 interferon-inducible pef^e 

U241 83 phosphofructoKlnase \(PFKM9 m 

HSLY0XL7 lysyl oxldase-llke protein gen 

U25789 ribosomal protein L21 mRNA, : 

U27333 alP^ Ml ,3^ fucosyltransfera 

U27333 alpha S(1 ,3^ fucosyltransfera 

U27831 striatum-enriched phosj^atase 

U29175 transcriptional activator \(BR 

P£QP pigment epithelium-derived fac 

U30827 spOdng factor SRp40-3 \(SRp4 

USP1 4 tRNA^uanlne transglycosylase 

USP14 tRNA-guanine transglycosylase 

HDAC2 transcrif^nal regulator homo 

HEP27 Hep27 protein mRNA, :short«-ch 

Pl|s| cyloplssmic d]^in light chain 

U34880 DPH2LmRNA; :DPH2L *'mRNA." co 

y3634t Xq28 cosmid, creatine transpor 

y^754 TGF-beta receptor interacting 

U37012 cleavage and polyadenylation s 

U37148 silencing mediator of retinoid 

CTBP1 pho^oprotein CtBP mRNA, :C- 

P0LR2H RHA px}lymerB8e il subuntt \(hs 

SBMA3F serophorin 11) family homolog 

SEWK3F samaphorin 111 family homolog 

C11orf4 MOF1 mRNA. :chron(WsomB 11 op 

U40998 retinal protein \(HRG4^) mRNA. 

U41080 breast cancer, estrogen regula 

ADAM9 metalloprotease/dislntegrin/cy 

HUMN33S1 0 N33 protein f omd 1 \{N33V gen 

U4332a link protein mRNA, 

U43g0l 37 KD lamlnln receptor precurs 

U43901 37 kD lamlnln receptor precurs 

U45448 $*2x1 receptor mRNA, 

U48705 receptor tyrodne kinase DDR g 

U48938 amilorWe-sensitlv® epithelial 

U48936 amUoride-sensFtive epithelial 

y 49395 ionotropic ATP receptor P2X5a 

UBB uttkiuJtIn gwe, :ub«quitinB 



log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
log neg 
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U50523 at U50523 BRCA2 region, mRNA sequence CG log neg 

U50929~at BHMT b8talne:homocysteine methyltra log neg 

U52154*at KCNJ5 G proteiF>-cou|^ed inwardly rec log neg 

□52154*81 KC»IJ5 G prrtein-coupled Inwardly reo log neg 

U5Z596~s at U52896 adrenal Crekwp homolog \(Creb tog neg 

U53786"at U53788 envoplaWn \(EVPL\) mRNA, :8n log neg 

U55054''at HSKCC K-Cl cotransporter MhKCCl^ m log neg 

U55054'"at HSKCC K-CI cotransporter \(hKCC19 m log neg 

U5734l"r at U57341 neurofilament triplet U protei log neg 

U57342~« MLF2 rnyelodysplasla/myeloW leukemi log neg 

U57628''at RPGR retinitis pigmentosa GTPase re log neg 

U58882"at U58882 ribosomal protein S28 mRNA, : log neg 

U60975"at U80975 hybrid rece}«or gp25Q precureo log neg 

U60975"at UB0975 hybrid receptor gp250 fffecurso log neg 

U62739~at BCAT2 branched-chaln amino add amm log neg 

UQZ962~at EiF3S6 lnt-8mRNA, :eukaiyotjc trans log neg 

U8354rat U63541 mRNA expressed In HC/HCC liver log nag 

U64883~at PDCD1 hPD-1 \(hPD-1\) mRNA, tprogra log neg 

UeeoercdsS at ueeoei germllneT-cell recef^orbeta log neg 

U88406"at ~ EFNB3 putative EPH-related PTK recap log neg 

U86816''at SMARCC2 SWI/SNF complex 170 KDa subuni log neg 

U668l6lat SMARCC2 SWVSNF complex 170 KDa subuni log neg 

U67092 s at U67092 ataxls-talanglectasia locus pr log neg 

US7156~aT MEKKS mitogen-acUvated kinase kinas log neg 

U68105"s at HSPABPS13 poly\(A\)-bindlng protein \(PA log neg 

U70732~ma1 at GPT giutamate pyruvate transamlnas log neg 

U70887'*at " SLC21A2 prostaglandin transporter hPGT log neg 

U73379~at U73379 cycftn-selective ubiqultin car log neg 

U73379~at U73379 cydln-selecOve ul^quiUn car log neg 

U73824~at HF4G2 p97mRNA, teukaryotlctransia log neg 

U73843"at U73843 eptthellal^clftotranscriF^ log neg 

U77456~at NAF1L4 nucleosome assembly protein 2 log neg 

U77848""ma1 at U77B45 elastln gene, partial cds and log neg 

U7784e~mars at U77849 elasUn gene, partial cds and ogneg 

U77848''mars~at U77846 elastin gene, partial cds and ogneg 

U78027"ma3""at U78027 Bruton's tyrosine kinase \<BTK log neg 

U78095"at " U78095 placental bikunln mRNA, :Plao log neg 

078678"^ U78878 thloredoxin mRNA, nuclear gene log neg 

U78722lal U78722 zinc finger jwotein 165 \(Zpf1 log neg 

U78735 at U78735 ABC3mRNA, toQneg 

U7925Q''at U79258 done 23719 mRNA sequence. log neg 

U7g280''at U79280 done 23575 mRNA, log neg 

U7S299~al U79299 neuronal olfactomedirKrelated log neg 

Ua0184"ma1 at FLU FUlgene, :nightless I \(Dr log neg 

U81984''at " EPAS1 endothelial PAS domain protein log neg 

U82169"'at FZD9 frizzled homolog MFZDS^ mRNA log neg 

U82169~at FZDB frizzied homdog HFZD3^ mRNA log neg 

U82818''at U82818 UCPSS mRNA, [ogneg 

U83248 at CPNE1 coplne I mRNA, :copjne I :cop og neg 

U83S98_al U83S98 death domain receptor 3 solubl log neg 

U8Q136 at U86136 telomerase-assodated protein It^neg 

U87972"at U87972 NAt>4socltratB dehydrogenase logpos 

UB8984"at ISG20 HEM45mRNA, :interferon sUmu logpos 

U89328''at U89328 bone morphogenetic protein rec logpos 

U9042elat DDXL nuclear RNA hellcase, :nuolea tog pes 

2A4> 
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U90S52_s^at 

U909l3_at 

U80916_at 

Ug4747_at 

U95740_ma1_at 

V00571_ma1^at 

W0572lat 

V00594„s_at 

V01512_ma1„al 

X00274_at 

X00274_at 

XD0351Xat 

X00368jxpt2Lat 

X01677J^at 

X02152_at 

)a2596_0t 

X03068J_al 

X031O0_cds2_at 

X03100_cds2_at 

X03342.at 

X03689_s_at 

X03889_s_at 

X04347_s_st 

X04347_s_at 

X06814_at 

XD6617_at 

X06985_at 

X06985_at 

X07698_at 

X07730_at 

XD7730_at 

X12447_St 

X12e71^ma1_at 

X12671„ma1_at 

X12876_s_at 

X12876_s„at 

X13334_at 

X13S46_ma1_8l 

X13794_ma1_at 

X13794_ma1_st 

Xl4008_malX«^ 

X15940.at 

X16064_st 

Xl6832_at 

X17042_at 

X17208_at 

X51345^at 

X51468_at 

X5ie88_at 

X52003_at 

X52003_at 

X52428_s_al 

X52428_s_al 

XS2851_ma1_^ 

XS2968_al 



U90552 
U909t3 
U90916 

U95740 

HSPCRF 

HSPGK1 

HSTHIO 

HSCFOS 

HSHL07 

HSHL07 

HSAC07 

HSPR0L1 

HSGAPW 

HSLDHAR 

HSBCRR 

HSHLDQWB 

HSHLASBA 

HSHIASBA 

HSRFL32 

HSEFTUR5 

HSEFTURS 

HSUP1R1 

HSUPIR1 

HSRRA 

HSRPS11 

HSOXYGR 

HSOXYGR 

HSKERC15 

HSPSA 

HSPSA 

HSALDQA 

HSHNRNPA 

HSHNRNPA 

HSKERtSA 

HSKER1BA 

HSCD14R 

H8HWiG17G 

HSLDHB1 

HSLDHB1 

HSLYSOZY 

H8RPL31 

HSTUMP 

HSCATHH 

HSHPCP 

HSLLREP3 

HSJUNB 

HSEF2 

HSCYCUNA 

HSPS2f^KN 

HSPS2WKN 

HSCYTK 

HSCYTK 

HSCPH70 

HSL35A 



butyrophilin MBTFS^ mRNA, : 
clone 23865 mRNA sequence. :cl 
cione 23815 mRNA sequence. :cl 
WD repeat protein HAN1 1 mRNA, 
Chromosome 18 BAG cJone CIT987 
gene encoding prepro fomi of c 
mRNA enccxJlng phosplioglycarate 
metallothlonefn from catJmium- 
cellular oncogene o-fos \(comp 
gene for HLA-DR alpha heavy ch 
gene for HLA-DR alpha heavy ch 
b^actln. 

prolactin gene 5* region, 
liver olyceraWehyd^3-phosph 
lactate dehydiX)genase-A MLDH 
bcr \(breakpolnt cluster regi 
HIA^D Class II antigen DQwH 
HLA-SB\(DP^) alpha gene, rHLAr 
HLA-SB\(DP^ alpha gene. :HLA- 
rilxjsomal protein t^- :rit>os 
mRNA fragment for elongation f 
mRNA fragment for elongation f 
liver mRNA fragment DNA blndin 
liver mRNA fragment DNA bindin 
receptor of retinoic ackJ. :r 
nbosomal protein S11. : nbo 
heme oxygenase, :heme oxygena 
heme oxygenase. :heme oxygena 
cytokeratin 15. ;kerstln 15 ; 
prostate spedfic antigen. : 
[Kostate specific ant^en. : 
aldolase A gene HEC 4,1.2,13 
gene for hstertgensous nuclear 
gene for heterogeneous nudear 
mmk fragment for cytoKeratin 
mRNA fragment for cytoksratln 
GD14 myelklcell-spedflcleu 
HMG-1 7 gene for non-htstone ch 
lactate dehydrogenase B gen© e 
lactate dehydrogenase B gene e 
lysozyme gene \(EC 3.2.1, 1T\J. 
ritx>somal protein :ribos 
translationaily controlled tu 
cathepsin H \(EC 3 A22.16^. 
hematopoetic proteoglycan cor 
LLRep3. : LlR8p3 
jun-B JUN-B protein. :jun B p 
elongation factor 2. : elonga 
cyclinA. . 

pS2 protein gene. :tr0foil fac 
pS2 protein gene, drefoll fao 
cytokeraUn 13, : cytokeratln 
cylokeratln 13. : cytokeratin 
cydophilln gene for cydophU 
ribosomai protein L35a. :ribo 



log pes 
log pes 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pes 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
l og pos 
log pos 
log pos 
log pos 
log pos 
l og pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
log pos 
l og pos 
log pos 
log pes 
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X53588_m®1_at 

X5a587lat 

XS3777^al 

X54232^at 

X54887_at 

X54942_at 

X54942_st 

X55005_ma1.at 

X55715„at 

X55954^at 

X56494_at 

»8S87j5_st 

XS8S07_at 

X56841„al 

^9S2Lat 

X57351_8t 

)GW351_at 

X57351_s_at 

X57S09_s_ai 

X57809„s_at 

X57958_at 

X5a07a^at 

X59373_at 

X59798_at 

XB0489_at 

X81587_at 

X62320_at 

X62466_at 

X82466.at 

X5»54_ma1„at 

X6^91_al 

^3358^81 

XS3527_at 

X63629.HI 

X64229_ai 

X64707_at 

XB56101 

X66114_nia1_at 

)®8383_sl 

X88363_al 

X66a99_al 

X67247_niaij^ 

XS7325_®1 

}«7951^st 

X68314_at 

X68314_at 

X68888_/na1_s^ 

X89150„^ 

X69391_8t 

XS9560_at 

X89654_at 

X70940_s^at 

X?0940_s_at 

X730f79_at 

X73358_s_at 



at 



HSINTA6R 

HSINTB4R 

HSU3MR 

HSGLYPIC 

HSCYSTATS 

HSCKSH82 

HSCKSHS2 

HSCERBAR 

HSHUMS3 

HSL17ARP 

HSPKI^12 

HSAUTNOR 

HSOGll 

HSHLAE 

HS23KDHBP 

HS18D 

HS18D 

HS18D 

HSIGVL009 

HSIGVL009 

HSRBPRL7A 

HSGATA3R 

HSH0X4D 

HSPRAD1CY 

HSEF1B 

HSRHOG 

HSEPm 

HSCAMPAT1 

HSCAMPAT1 

HSMECDAG 

HSRPRNA 

HSUGT2BI0 

HSRPL19 

HSPCAD 

HSDEK9 

HSBBC1 

HSS100PCB 

HS20X0C 

H8STHPKD 

HSSTHPKD 

HSEWS 

HSRPS8 

HSP27 

HSPAG 

HSGPGI 

HSGPGl 

HSZNB 

HSRPS18 

HSRFIJ6AA 

HSfWOl 

HSS25 

HSEFAC1A2 

H8EFAC1A2 

HSPIR 

HSAES1 



integrln alpha 6, :)megrin, 
integrin beta 4. : tntegrin b 
L23 putative ribosomal protel 
heparan sulfate proteaglycan 
cystatinS. 

cteshsZ CKsl protein homologue 
ckshs2 Cksi protein t^mologue 
{>erbA-1 thyroid honmone rece 
Hums3 40B ribosomat protein s 
HL23 ribosomal protein homolo 
U gene for Mi-type and M24ype 
autoantlgen NOR-90. 
DSC2 desmocoUlnstype2aand 
HLA-E gene, rmajor hlstocompat 
23 kD highly basic protein. 
1-80 gene from Interferon-indu 
1-8D gene from Interferon-lndu 
1-8D gene from interferon-lndu- 
teaiTanged Irwnunoglobuim lamb 
reairanged Immunoglobulin lamb 
ribosomal lutein L7. rriboso 
hGATA3 trans-actlng T-cell sp 
H0X4D a homeobox protein, :ho 
PRAD1 cydin, :PRAD1 cydln 
elongation fador-i-beta. 
moG GTPase. :ras homolog gen 
eplthelin 1 and 2. : epHheli 
CAMPATH-1 \tCDw52^ antigen. 
CAMPATH-1 \(CDw52>) antigen, 
gene for ye491/CD63 an^en. : 
ribosomal pnjteln Mhomologuo 
UGT2BIO udp glucuronosyltrans 
itoosomal protein LI 9. :ribos 
p cadherin. :cadherin 3, P-<ja 
dekmRNA.:DEKgene 
BBC1 mRNA. 

caldum-Wnding protein S1C30P 
gene for 2-oKogiutarate carrie 
mRNA PCTAiRE-1 f or serine/Uire 
mRHAPCTAIRE-1 for seilneAhre 
EWS mRNA. :Ewng sarcoma break 
rpS8 gene for rib<^mal protel 
p27 mRNA. linterfemn, alpha-i 
proliferBtlon-assodated gene 
glutathione peroxidase-GI. 'g 
glutathione peroxidase-Gl. :g 
ZNF33Bgene. 
rilKJsomal protein SIS. :ribos 
rit»somal protein L6. triboso 
rtio GDP-<iIssociation Inhibfto 
ribosomal protein S26. 
elongation fador 1 aipha-2. 
elongation fador 1 alpha-2. 
encoding Polymeric immunoglobu 
hAES-1 mRNA, lamino-temfiinal e 



log 
log 
log 

iog 
iog 
iog 
log 
log 
iog 
log 
log 
log 
iog 
log 
log 
iog 
log 
log 
log 
iog 
log 
log 
log 
log 
log 
log 
log 
iog 
log 
log 
iog 



pes 
pes 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pes 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pes 
^s 
pos 
pos 
pos 



log 
log pos 



log 



pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
pos 
l og pos 



log 
log 
log 
log 
log 
log 
log 
log 
log 
log 
log 

og 
log 

log 
log 
log 



pos 
pos 
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X73480„at 

X73478_at 

X74819_8l 

X74ai9_al 

X74929.s_at 

X75252_at 

X76534_at 

X76534_at 

X77794„at 

X78982_at 

X79439_at 

X80062_at 

)«0198_at 

X802C>0_8t 

X80822_at 

X80909_8t 

X82693_at 

X82693_at 

X834ie_s.8t 

X83495Lat 

X83492_at 

X83572^at 

X88809_at 

X87159_st 

X87241_at 

Xe9416_at 

X89416_at 

X90846_at 

X91103_at 

X93036_at 

X94583_xpt2_r_al 

X94812_at 

X95404_at 

X95735_at 

X958(»_S_at 

X98482Lat 

X98482_r_at 

X98482^_at 
'X98534_s_at 

X99133_at 

X9968a_al 

Y00082_at 

Y00503_at 

Y0Q705_at 

Y00787_s_at 

Y00787_s„at 

Y00796_at 

YOnSS^at 

Y077SS_at 

Y08374_ma1_€^ 

Y08639_at 

Y089763t 

Y10207_at 

Y10871_at 



HSRPL3A ribosomal protein L3. log pos 

HSPTPM hPTPA mRNA. :hPTPA mRNA log pos 

HSCARTROT caitJIac troponin T. log pos 

HSCARTROT carcHac troportln T. log pos 

HSKRT8 KRT8 keratin 8. :k8ratin 8 :K log pos 

HSPEABP puosf^atidyiethanolamina Inndl log pos 

HSNMB NMB mRNA. :transmembrane glyco log pos 

HSNMB NMS mRNA, :trans»iiembrane glyco log pos 

HSCYC01 cydln G1 . : cydin G1 fog pos 

HSERF2 ERF-2 mRNA, loS POS 

HSRPL1 1 ribosomal protein L1 1 . log POS 

HSN0TCH3 Notch 3 DMA sequence, :Natch log pos 

HSSAMRNA SA mRNA. lo9 pos 

HSMLN64 MUm mRNA. iog pos 

HSMLN62 MIMSZ mRNA. :TNF receptor-asso log pos 

HSPLOHF ORF. 1^9 pos 

HSANAC alpha NAC mRNA. :nasc8rt-polyp log pos 

HSE48 £48 antigen. : E48 antigen log pos 

HSE48 E48 antigen. : E48 antigen log |x>s 

HSPRP2 PrP gene, exon 2. f PrP ""gene log pos 

HSFAS47 F8S/Apo-1 \{clone pCRTMi 1-Fas log pos 

HSFAS47 Fas/Apo-I \(clone pCRTMl 1 -Fas log pos 

HSARSD ARSD gene, complete CDS. :aryl log pos 

HSPEA1 5 major sstrucytic phosf*oprote log pos 

HSSCNN1B beta subunit of epttheilal am log pos 

HSHFATPRO hFat protein. :FAT tumor supp log pos 

HSRNAPPP5 protein phosphatase 5. :prote log pos 

H5RNAPFP5 protein phosphatase 5. ;prote log pos 

HARNAMLK2 mixed lineage Kinase 2. log pos 

HSRNAHR44 Hf44 protein. log pos 

HS1MIAT82 MAT8 protein, iphospholemman- log pos 

HSDBIEX12 dbl/acbp gene exon 1^2. log pos 

HS2CGMPPK type tl cGMP-^^ependent protei log pos 

HSNMCFL1 non-musde type coffRn. :cof log pos 

HSZYXIN2R zyxin. :2yxi'n log pos 

HSDXS protein enccKJed by a candidal log pos 

HSTNNTX1 1 TNNT2 gene exon 11 log pos 

H5TNNTX11 TNNT2geneexon11. logpos 

HSTNMTX11 TNNT2 gene exon 11. logpos 

HSVASP413 VASP gen®, axons 4 to 1 3. :^A log pos 

HSNGALGEN NGAL gene. :lipocalin 2 \(onco log pos 

HSTYL mRNA from TYL gene, iplec^rl log pos 

HSLCA T200 !e«tax:yte common antigen logics 

HSKER19 keratin 1 9. :keretJn 19 : ker tog pos 

HSPSn pstl panoeatJc secretory inh log pos 

HSMDNCF MDNCF \(monocyte4efived naut log pos 

HSMDNCF MDNCF S(monocyteK3erived neut log pos 

HSFLA1A leukocyte-assodated molecule log pos 

HSS1 0OA2 S10(^ gene, exon 1 , 2 and 3. log pos 

HSS1 0OA2 S10QA2 gene, exon 1 , 2 and 3, log pos 

Y08374 gene encoding cartilage GP-39 log pos 

HSTFAC nudearoiphan receptor ROR-b logpos 

HSRNAFEV FEV protein. loSP^s 

HSCD171 CD171pmtein. logpos 

HSTWISTGE twist gene. :twist \(Drosophll log pos 
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Y12670_at 

Z1296^at 

Z19554„s_al 

Z19574_rna1_at 

Z22551_at 

Z2S749_ma1_at 

Z25B84_at 

Z25884_ai 

Z28491„s_at 

228407_at 

228407_St 

Z3Q643_al 

Z32765„8t 

Z35402_ma1_s_at 

23540^ma1_s_at 

Z48501_s_at 

Z48950_at 

Z49107_s„at 

Z49148_5_at 

Z49148_s_at 

Z49835_sjlt 

Z50022_at 

Z59043_s_at 

Z70759_at 

Z80783_at 

ZBD787_al 

Z80787_at 

283804_at 

Z84721_als2_at 

Z84721_cd^_at 

2»3784_at 



HSOBRGRP 

HSRPL41 

HSVIWENTA 

HSCYTOK17 

HSKINEC 

HSRPS7 

HSCLC1MR 

HSCLC1MR 

HSC0MT2 

HSRBPL8 

HSCUCHPRA 

HSCD36G15 

HSECAD3 

HSECAD3 

HSPABPii 

HSHH3X3B 

Z49107 

HSRPU9 

HSRPL29 

HSP2SIS0M 

HSSGP1N15 

HSTRAFRNA 

HSM243 

HSH2BL 

HSH4J 

HSH4J 

HSHDHC7 

HSGG1 

HSGG1 

HS398a22 



Isptin recepttK- gene-related 
homologue to yeast ribosomal 
vimemin gene, :vim8rrtin gene 
gene for cytoKeratin 17, :gene 
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gAIMS 

1. An^od of detennining an expression pattern of a cell sample 
independent of the proportion of submucosal, smooth muscle, 
or connective tissue cdls present, comprising: 

determining expression of one or more g«ies in a sample con^^nsing 
cells, wherein the one or more genes excludes genes which are expressed in the 
submucosal, smooth musde, or connective tissue, whereby a pattan of 
©q^ession is fcraied for ±e sample which is independent of the proportion of 
submucosal, smooth muscle, or connective tissue cells m the sample. 

2. The method of daim 1 wherein the sample comprises epithelial 
or carcinoma cells. 

3 . The method of daim 2 wherein the sample comprises urothelial 
or bladder cancer cells. 

4. A method of determining an expression patten of a cell sample, 
comprising: 

determining expression of one or more genes in a sample comprising 
cells, whereby a first pattern of expression is formed for the sample; 

removing expression of genes which are expressed in submucosal, 
^ooth musde, or connective tissue ceUs from the first pattern of expression, 
wberelty a second pattern of expression is formed, wherein the second pattern 
is indqpendent of die proportion of submucosal, smooth musde, or connective 
tissue cells in the san^sle. 

5. The m^kxi of claim 4 whs^ the ceil sample is an epithdium 
or cardnoma sample. 

6. The method of claim 5 wherdn the cell sample is a urothdium 
or bladder cancer sample. 

7. A method fi>r d^ermining an expression pattern of a urothelium 
or bladder cancer cell, comprising: 

determining expression of one or more genes in a sample comprising 
urothelium or bladder canc^ cells, whereby a first pattern of expression is 
formed; 



WO0Q/S2204 



PCT/IBOT/00367 



subtracting from the first pattern of expresaon a second pattern of 
expression, wherein the second pattern was formed using the one or more 
genes and a sample comprising predominantiy submucosal, smooth muscle, or 
connective tissue ceUs. said step of subtracting forming a third pattern of 
5 expression wWch reflects expression of the urotheUum or bladder cancer ceUs 
independent of the proportion of submucosal, smooth muscle, or connective 
tissue cells present in the sample. 

8. A method of detectmg an invaave tumor in a patient, 

comprising: 

10 detecting in a sample of a body fluid a marker which is more prevalent 

in submucosal, smooth muscle, or connective tissue than in the body fluid, 
wherein the marker is an mRNA or protein expression product of a gene, 
wherein an increased amount of the marker in tiie body fluid indicates a tumor 
which has become invasive in the patient. 
15 9, The method of claim 8 wherem the body fluid is selected from 

the group consisting of blood, plasma, serum, urine, ascites 
fluid, pleural fluid, spinal fluid, sputum, and mucous secretions. 

10. The method of claim 8 wherein the marker is a protein 
characteristic of submucosal, smooth muscle, or connective 

20 tissue, but not found in the body fluid. 

11. A method to diagnose a bladder cancw comprising: 
determiiang a first pattern of expression of one or more genes in a 

bladder tissue sample suspected of bang neoplastic; 

comparing the first pattern of expression to a second and third 
25 reference pattern of expression, wheren the second pattern is of the one or 
more genes in normal urotheUum and the third pattern is of the one or more 
genes m bladder cancer, wher«in a first pattern of expression which is more 
similar to the third pattern than the second indicates neoplasia of the bladder 
tissue sample. 
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12. A method to predict outcome or prescribe treatment of a 
bladder tumor, comprising: 

determining a fiirst pattern of repression of one or more graes in a 

bladder tumor sample; 
5 comparing the first pattern to one or more reference patterns of 

expression detennined for bladder tumors at grades I to IV; 

detennining which of the reference patterns shares maxunum similarity 

with the first pattern, wherein the outcome or treatment appropriate for the 

grade of tumor of the reference pattern witii the maxinami similarity is assigned 
10 to the bladder tumor sample. 

13. A method to determine grade of a bladdea: tumor,, comprising: 
determining a first pattern of expression of one or more genes in a 

bladder tumor sample; 

comparing the first pattern to one or more reference patterns of 
15 expression determined for bladder tumors at grades I to IV; 

determining whidi of the reference patterns shares maximum similarity 
with the first pattern, wherein the grade of the reference pattern with the 
maximum similarity is assigned to the bladder tumor sample, 

14. A method to determine stage of a bladder tumor, comprising: 
20 determinmg a first pattern of repression of one or more genes in a 

bladder tumor sample; 

comparing the first pattern to one or more stage-spedfic reference 

patterns; 

detennimng wWch of the stage-specific reference patterns shares 
25 maximum similarity wHh the first pattern, wherein the stage of the reference 
pattern with the maxinmm similarity is assigned to the bladder tumor sample, 

15. The method of claim 14 wherdn the pattern of expression of 
the bladder tumor sample and the rrfarence patterns comprise 
data on the expression of one or more genes selected fi-om the 

30 group consisting of TCC-rdated genes, bladder papilloma- 

related genes, and invasive TCC-related genes. 
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16. A method of identifying a tissue sample as urotheUal, 
comprising: 

detennining a first pattern of expression of one or more genes in a 
tissue sample; 

5 comparing the first pattern of expression to a second pattern of 

expresaon obtained fi-om normal urothelial cdls; wherein similarity between 
the first and second patterns identifies the tissue sample as urothelial in its 
origin. 

17. The mrthod of claim 16 fbitiier comprising the step of. 

10 comparing the first pattern to one or more third patterns of expression 

obtained from otiier cell types, wherein differences between the first 
and third patterns confirms the suggestion that the tissue sample is 

urothelial in origin. 

18. A method to identiiy a set of genes usefijl for dia^osing, 
15 predicting outcome, or prescribing treatment of a bladdw- 

cancer comprising: 
determining a first pattern of expression of one or more genes in a first 

bladder tissue sample; 

determining a second pattern of expression of the one or more genes 
20 in a second bladder tissue sample, wherein the first bladder tissue sample is a 
nonnal unrthdium sample or an earlier stage or lower grade of bladder tumor 
than the second bladder tissue sample; 

comparing the fii^ pattam of expression to the second pattern of 
expression to identify a first s«* of genes whose expression is increased or 
25 decreased in the second bladder tissue sample relative to tiie first bladder tissue 
sample; 

removing from the first set of genes tiiose genes which are expressed 
in subnajcosal, smooth muscle or connective tissue to produce a second set of 
genes, wherem measurement of expression of tiie s«»nd srt of gwies can be 
30 used &x diagnosing, predicting outcome, or prescribmg treatmmt of a bladder 
cancer; 



19. A method of detennining an expression pattern of a bladder 
tissue sample independent of the proportion of submucosal, 
smooth muscle, or connective tissue cells present, comprising: 

isolating a single-cell suspension of dis^gregated bladder tumor ceUs 
fiom a bladder tissue saiqjle comprising bladder c^lls, and cells of one or more 
of a ceU type selected firom the group consisting of submucosal cells, smooth 
muscle cells, or connective tissue cells; 

determining expression of one or more genes in the single-cell 
suspension, whereby a pattern of expression is formed for the sample wiuch is 
independent of the proportion of submucosal, smooth muscle, or connective 
tissue cells in the bladder tissue sample. 

20. The method of any of claims 1-19 wherein expression of a gene 
is determined by assaying for an mKNA transcribed from the 
gene or a protein translated from an mRNA transcribed from 
the gene. 

21. The method of any of clarnis 1-19 wherein expression of a 
plurality of genes is determined. 

22. A method of screening for candidate therapeutic agents for 
treating bladder cancer, comprismg the steps of: 

contacting bladder tumor cells with a lest compound; 

determining gene repression of one or more genes in the bladder tumor 
cells which have faera contacted with the test compound, wherein expression 
of the one or more g^ dian^ during the development of a bladder cancer; 

identifying a test compound as a candidate therapeutic agrat if it causes 
gene expression of at least one of the one or more genes to change to a level 
which is characteristic of an earlier stage of cancer progression. 

23. Amethod of categorizing a tumor, comprismg the steps of: 
mixing cdls of a plurality of tumors, ^erem the tumors are of a single 

type and of a similar stage or grade to fonn a pool; 

determining expression of one or more genes in the pool; 
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comparing expression of the one or more genes in the pool to 
expression in a test sample derived from a tumor, wherein sJmaarities between 
the test sample esqffesaon and the pool expression pennit categorization of the 
tumor. 

5 24. A method of categoriiing a tumor, comprising the steps of: 

mixing one or more gene products from cells of a plurality of 
tumors, wherein the tumors are of a single type and of a simHar stage or grade 
to form a pool, wherdn the gene product is mRNA or protdn; 

determming expression of one or more genes m the pool by 
10 assaying the gene product in the pool; 

comparing expression of the one or more genes in the pool to 
expression in a test sample derived from a tumor, wherein similarities between 
the test sample expres^on and the pool expression permit categorization of the 
tumor. 

15 25 . A method of categorizing a tumor, comprising the steps of: 

determming expression of one or more genes in a plurality of 

tumors, wheron the tumors are of a single type and of a similar stage or grade; 

combining data determined for the «pression of the one or 

more genes to form a data pool; 
20 comparing expression ofthe one or more genes in the data pool 

to expression in a test sample dorived from a tumor, wherein similarities 

between the test sample expression and the data pool pwnit categorization of 

the tumor. 

26. The ra^od of claim 23, 24, or 25 wherdn the tumors are 
25 bladder tumors. 
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